Lu 
cS 
= 
o 
Uv 
wv 
N 


Undergraduate Texts in Mathematics 


Editors 
S. Axler 
K.A. Ribet 


For other titles published in this series, go to 
http://www.springer.com/series/666 


John M. Harris - Jeffry L. Hirst - Michael J. Mossinghoff 


Combinatorics and 
Graph Theory 


Second Edition 


Q) Springer 


John M. Harris 

Department of Mathematics 
Furman University 
Greenville, SC 29613 

USA 

john.harris @ furman.edu 


Michael J. Mossinghoff 
Department of Mathematics 
Davidson College 

Box 6996 

Davidson, NC 28035-6996 
USA 

mimossinghoff @davidson.edu 


Editorial Board 

S. Axler 

Mathematics Department 

San Francisco State University 
San Francisco, CA 94132 
USA 

axler@sfsu.edu 


ISSN: 0172-6056 
ISBN: 978-0-387-7977 10-6 


DOI: 10.1007/978-0-387-797 11-3 


Jeffry L. Hirst 

Mathematical Sciences 
Appalachian State University 
121 Bodenheimer Dr. 

Boone, NC 28608 

USA 

jlh@math.appstate.edu 


K.A. Ribet 
Department of Mathematics 
University of California 
at Berkeley 
Berkeley, CA 94720 
USA 
ribet @ math.berkeley.edu 


e-ISBN: 978-0-387-7971 1-3 


Library of Congress Control Number: 2008934034 


Mathematics Subject Classification (2000): 05-01 03-01 


© 2008 Springer Science+Business Media, LLC 


All rights reserved. This work may not be translated or copied in whole or in part without the written 
permission of the publisher (Springer Science+Business Media, LLC, 233 Spring Street, New York, 
NY 10013, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use 
in connection with any form of information storage and retrieval, electronic adaptation, computer 
software, or by similar or dissimilar methodology now known or hereafter developed is forbidden. 

The use in this publication of trade names, trademarks, service marks, and similar terms, even if they 
are not identified as such, is not to be taken as an expression of opinion as to whether or not they are 


subject to proprietary rights. 
Printed on acid-free paper 


springer.com 


To 
Priscilla, Sophie, and Will, 
Holly, 


Kristine, Amanda, and Alexandra 


Preface to the Second Edition 


There are certain rules that one must abide by in order to create a 
successful sequel. 
— Randy Meeks, from the trailer to Scream 2 


While we may not follow the precise rules that Mr. Meeks had in mind for suc- 
cessful sequels, we have made a number of changes to the text in this second 
edition. In the new edition, we continue to introduce new topics with concrete ex- 
amples, we provide complete proofs of almost every result, and we preserve the 
book’s friendly style and lively presentation, interspersing the text with occasional 
jokes and quotations. The first two chapters, on graph theory and combinatorics, 
remain largely independent, and may be covered in either order. Chapter 3, on 
infinite combinatorics and graphs, may also be studied independently, although 
many readers will want to investigate trees, matchings, and Ramsey theory for 
finite sets before exploring these topics for infinite sets in the third chapter. Like 
the first edition, this text is aimed at upper-division undergraduate students in 
mathematics, though others will find much of interest as well. It assumes only 
familiarity with basic proof techniques, and some experience with matrices and 
infinite series. 

The second edition offers many additional topics for use in the classroom or for 
independent study. Chapter | includes a new section covering distance and related 
notions in graphs, following an expanded introductory section. This new section 
also introduces the adjacency matrix of a graph, and describes its connection to 
important features of the graph. Another new section on trails, circuits, paths, 
and cycles treats several problems regarding Hamiltonian and Eulerian paths in 
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graphs, and describes some elementary open problems regarding paths in graphs, 
and graphs with forbidden subgraphs. 

Several topics were added to Chapter 2. The introductory section on basic 
counting principles has been expanded. Early in the chapter, a new section covers 
multinomial coefficients and their properties, following the development of the 
binomial coefficients. Another new section treats the pigeonhole principle, with 
applications to some problems in number theory. The material on Polya’s theory 
of counting has now been expanded to cover de Bruijn’s more general method of 
counting arrangements in the presence of one symmetry group acting on the ob- 
jects, and another acting on the set of allowed colors. A new section has also been 
added on partitions, and the treatment of Eulerian numbers has been significantly 
expanded. The topic of stable marriage is developed further as well, with three 
interesting variations on the basic problem now covered here. Finally, the end 
of the chapter features a new section on combinatorial geometry. Two principal 
problems serve to introduce this rich area: a nice problem of Sylvester’s regard- 
ing lines produced by a set of points in the plane, and the beautiful geometric 
approach to Ramsey theory pioneered by Erdés and Szekeres in a problem about 
the existence of convex polygons among finite sets of points in the plane. 

In Chapter 3, a new section develops the theory of matchings further by in- 
vestigating marriage problems on infinite sets, both countable and uncountable. 
Another new section toward the end of this chapter describes a characterization 
of certain large infinite cardinals by using linear orderings. Many new exercises 
have also been added in each chapter, and the list of references has been com- 
pletely updated. 

The second edition grew out of our experiences teaching courses in graph the- 
ory, combinatorics, and set theory at Appalachian State University, Davidson Col- 
lege, and Furman University, and we thank these institutions for their support, and 
our students for their comments. We also thank Mark Spencer at Springer-Verlag. 
Finally, we thank our families for their patience and constant good humor through- 
out this process. The first and third authors would also like to add that, since the 
original publication of this book, their families have both gained their own second 
additions! 


May 2008 John M. Harris 
Jeffry L. Hirst 
Michael J. Mossinghoff 


Preface to the First Edition 


Three things should be considered: problems, theorems, and 
applications. 
— Gottfried Wilhelm Leibniz, 
Dissertatio de Arte Combinatoria, 1666 


This book grew out of several courses in combinatorics and graph theory given at 
Appalachian State University and UCLA in recent years. A one-semester course 
for juniors at Appalachian State University focusing on graph theory covered most 
of Chapter | and the first part of Chapter 2. A one-quarter course at UCLA on 
combinatorics for undergraduates concentrated on the topics in Chapter 2 and 
included some parts of Chapter 1. Another semester course at Appalachian State 
for advanced undergraduates and beginning graduate students covered most of the 
topics from all three chapters. 

There are rather few prerequisites for this text. We assume some familiarity 
with basic proof techniques, like induction. A few topics in Chapter | assume 
some prior exposure to elementary linear algebra. Chapter 2 assumes some famil- 
larity with sequences and series, especially Maclaurin series, at the level typically 
covered in a first-year calculus course. The text requires no prior experience with 
more advanced subjects, such as group theory. 

While this book is primarily intended for upper-division undergraduate stu- 
dents, we believe that others will find it useful as well. Lower-division undergrad- 
uates with a penchant for proofs, and even talented high school students, will be 
able to follow much of the material, and graduate students looking for an intro- 
duction to topics in graph theory, combinatorics, and set theory may find several 
topics of interest. 
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Chapter | focuses on the theory of finite graphs. The first section serves as an 
introduction to basic terminology and concepts. Each of the following sections 
presents a specific branch of graph theory: trees, planarity, coloring, matchings, 
and Ramsey theory. These five topics were chosen for two reasons. First, they 
represent a broad range of the subfields of graph theory, and in turn they provide 
the reader with a sound introduction to the subject. Second, and just as important, 
these topics relate particularly well to topics in Chapters 2 and 3. 

Chapter 2 develops the central techniques of enumerative combinatorics: the 
principle of inclusion and exclusion, the theory and application of generating 
functions, the solution of recurrence relations, Pélya’s theory of counting arrange- 
ments in the presence of symmetry, and important classes of numbers, including 
the Fibonacci, Catalan, Stirling, Bell, and Eulerian numbers. The final section in 
the chapter continues the theme of matchings begun in Chapter | with a consider- 
ation of the stable marriage problem and the Gale—Shapley algorithm for solving 
it. 

Chapter 3 presents infinite pigeonhole principles, K6nig’s Lemma, Ramsey’s 
Theorem, and their connections to set theory. The systems of distinct representa- 
tives of Chapter | reappear in infinite form, linked to the axiom of choice. Count- 
ing is recast as cardinal arithmetic, and a pigeonhole property for cardinals leads 
to discussions of incompleteness and large cardinals. The last sections connect 
large cardinals to finite combinatorics and describe supplementary material on 
computability. 

Following Leibniz’s advice, we focus on problems, theorems, and applications 
throughout the text. We supply proofs of almost every theorem presented. We 
try to introduce each topic with an application or a concrete interpretation, and 
we often introduce more applications in the exercises at the end of each section. 
In addition, we believe that mathematics is a fun and lively subject, so we have 
tried to enliven our presentation with an occasional joke or (we hope) interesting 
quotation. 

We would like to thank the Department of Mathematical Sciences at Appala- 
chian State University and the Department of Mathematics at UCLA. We would 
especially like to thank our students (in particular, Jae-I] Shin at UCLA), whose 
questions and comments on preliminary versions of this text helped us to improve 
it. We would also like to thank the three anonymous reviewers, whose suggestions 
helped to shape this book into its present form. We also thank Sharon McPeake, 
a student at ASU, for her rendering of the K6nigsberg bridges. 

In addition, the first author would like to thank Ron Gould, his graduate advi- 
sor at Emory University, for teaching him the methods and the joys of studying 
graphs, and for continuing to be his advisor even after graduation. He especially 
wants to thank his wife, Priscilla, for being his perfect match, and his daughter 
Sophie for adding color and brightness to each and every day. Their patience and 
support throughout this process have been immeasurable. 

The second author would like to thank Judith Roitman, who introduced him to 
set theory and Ramsey’s Theorem at the University of Kansas, using an early draft 
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of her fine text. Also, he would like to thank his wife, Holly (the other Professor 
Hirst), for having the infinite tolerance that sets her apart from the norm. 

The third author would like to thank Bob Blakley, from whom he first learned 
about combinatorics as an undergraduate at Texas A & M University, and Don- 
ald Knuth, whose class Concrete Mathematics at Stanford University taught him 
much more about the subject. Most of all, he would like to thank his wife, Kris- 
tine, for her constant support and infinite patience throughout the gestation of this 
project, and for being someone he can always, well, count on. 


September 1999 John M. Harris 
Jeffry L. Hirst 
Michael J. Mossinghoff 
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1 
Graph Theory 


“Begin at the beginning,” the King said, gravely, “and go on till you 
come to the end; then stop.” 
— Lewis Carroll, Alice in Wonderland 


The Pregolya River passes through a city once known as KOnigsberg. In the 1700s 
seven bridges were situated across this river in a manner similar to what you see 
in Figure 1.1. The city’s residents enjoyed strolling on these bridges, but, as hard 
as they tried, no resident of the city was ever able to walk a route that crossed each 
of these bridges exactly once. The Swiss mathematician Leonhard Euler learned 
of this frustrating phenomenon, and in 1736 he wrote an article [98] about it. 
His work on the “K6nigsberg Bridge Problem” is considered by many to be the 
beginning of the field of graph theory. 


FIGURE 1.1. The bridges in K6nigsberg. 
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At first, the usefulness of Euler’s ideas and of “graph theory” itself was found 
only in solving puzzles and in analyzing games and other recreations. In the mid 
1800s, however, people began to realize that graphs could be used to model many 
things that were of interest in society. For instance, the “Four Color Map Conjec- 
ture,” introduced by DeMorgan in 1852, was a famous problem that was seem- 
ingly unrelated to graph theory. The conjecture stated that four is the maximum 
number of colors required to color any map where bordering regions are colored 
differently. This conjecture can easily be phrased in terms of graph theory, and 
many researchers used this approach during the dozen decades that the problem 
remained unsolved. 

The field of graph theory began to blossom in the twentieth century as more 
and more modeling possibilities were recognized — and the growth continues. It 
is interesting to note that as specific applications have increased in number and in 
scope, the theory itself has developed beautifully as well. 

In Chapter | we investigate some of the major concepts and applications of 
graph theory. Keep your eyes open for the Konigsberg Bridge Problem and the 
Four Color Problem, for we will encounter them along the way. 


1.1 Introductory Concepts 


A definition is the enclosing a wilderness of idea within a wall of 
words. 
— Samuel Butler, Higgledy-Piggledy 


1.1.1. Graphs and Their Relatives 


A graph consists of two finite sets, V and . Each element of V is called a vertex 
(plural vertices). The elements of EF, called edges, are unordered pairs of vertices. 
For instance, the set V might be {a,b,c,d,e, f,g,h}, and E might be {{a, d}, 
{a,e}, {b,c}, {b, e}, {b, g}, {c, fh. {d, f}. {d, 9g}, {g, h}}. Together, V and £ 
are a graph G. 

Graphs have natural visual representations. Look at the diagram in Figure 1.2. 
Notice that each element of V is represented by a small circle and that each ele- 
ment of F is represented by a line drawn between the corresponding two elements 
of V. 


e i g h 
FIGURE 1.2. A visual representation of the graph G. 
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As a matter of fact, we can just as easily define a graph to be a diagram consist- 
ing of small circles, called vertices, and curves, called edges, where each curve 
connects two of the circles together. When we speak of a graph in this chapter, we 
will almost always refer to such a diagram. 

We can obtain similar structures by altering our definition in various ways. Here 
are some examples. 


1. By replacing our set F with a set of ordered pairs of vertices, we obtain 
a directed graph, or digraph (Figure 1.3). Each edge of a digraph has a 
specific orientation. 


FIGURE 1.3. A digraph. 


2. If we allow repeated elements in our set of edges, technically replacing our 
set / with a multiset, we obtain a multigraph (Figure 1.4). 


FIGURE 1.4. A multigraph. 


3. By allowing edges to connect a vertex to itself (“loops’’), we obtain a pseu- 
dograph (Figure 1.5). 


FIGURE 1.5. A pseudograph. 
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4. Allowing our edges to be arbitrary subsets of vertices (rather than just pairs) 
gives us hypergraphs (Figure 1.6). 


FIGURE 1.6. A hypergraph with 7 vertices and 5 edges. 


5. By allowing V or EF to be an infinite set, we obtain infinite graphs. Infinite 
graphs are studied in Chapter 3. 


In this chapter we will focus on finite, simple graphs: those without loops or 
multiple edges. 


Exercises 


1. Ten people are seated around a circular table. Each person shakes hands 
with everyone at the table except the person sitting directly across the table. 
Draw a graph that models this situation. 


2. Six fraternity brothers (Adam, Bert, Chuck, Doug, Ernie, and Filthy Frank) 
need to pair off as roommates for the upcoming school year. Each person 
has compiled a list of the people with whom he would be willing to share a 
room. 


Adam’s list: Doug 

Bert’s list: Adam, Ernie 
Chuck’s list: Doug, Ernie 
Doug’s list: Chuck 
Ernie’s list: Ernie 
Frank’s list: Adam, Bert 


Draw a digraph that models this situation. 


3. There are twelve women’s basketball teams in the Atlantic Coast Confer- 
ence: Boston College (B), Clemson (C), Duke (D), Florida State (F), Geor- 
gia Tech (G), Miami (1), NC State (S), Univ. of Maryland (M), Univ. of 
North Carolina (N), Univ. of Virginia (V), Virginia Tech (T), and Wake 
Forest Univ. (W). At a certain point in midseason, 


B has played I, T*, W 
C has played D*, G 
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D has played C*, S, W 
F has played N*, V 

G has played C, M 
Thas played B, M, T 

S has played D, V* 

M has played G, I, N 
N has played F*, M, W 
V has played F, S* 

T has played B*, I 

W has played B, D, N 
The asterisk(*) indicates that these teams have played each other twice. 


Draw a multigraph that models this situation. 


4. Can you explain why no resident of KOnigsberg was ever able to walk a 
route that crossed each bridge exactly once? (We will encounter this ques- 
tion again in Section 1.4.1.) 


1.1.2 The Basics 


Your first discipline is your vocabulary; 
— Robert Frost 


In this section we will introduce a number of basic graph theory terms and 
concepts. Study them carefully and pay special attention to the examples that are 
provided. Our work together in the sections that follow will be enriched by a solid 
understanding of these ideas. 


The Very Basics 


The vertex set of a graph G is denoted by V(G), and the edge set is denoted 
by E(G). We may refer to these sets simply as V and F if the context makes the 
particular graph clear. For notational convenience, instead of representing an edge 
as {u,v}, we denote this simply by wv. The order of a graph G is the cardinality 
of its vertex set, and the size of a graph is the cardinality of its edge set. 

Given two vertices u and v, if wv € FE, then u and v are said to be adjacent. In 
this case, u and v are said to be the end vertices of the edge uv. If wv ¢ E, then u 
and v are nonadjacent. Furthermore, if an edge e has a vertex v as an end vertex, 
we say that v is incident with e. 

The neighborhood (or open neighborhood) of a vertex v, denoted by N(v), is 
the set of vertices adjacent to v: 


N(v) ={«@ €V | va € E}. 
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The closed neighborhood of a vertex v, denoted by N |v], is simply the set {v} U 
N(v). Given a set S of vertices, we define the neighborhood of S, denoted by 
N(S), to be the union of the neighborhoods of the vertices in S. Similarly, the 
closed neighborhood of 5’, denoted N[S], is defined to be SU N(S). 

The degree of v, denoted by deg(v), is the number of edges incident with v. In 
simple graphs, this is the same as the cardinality of the (open) neighborhood of v. 
The maximum degree of a graph G, denoted by A(G), is defined to be 


A(G) = max{deg(v) | v € V(G)}. 
Similarly, the minimum degree of a graph G, denoted by 6(G), is defined to be 
6(G) = min{deg(v) | v € V(G)}. 


The degree sequence of a graph of order n is the n-term sequence (usually written 
in descending order) of the vertex degrees. 

Let’s use the graph G in Figure 1.2 to illustrate some of these concepts: G 
has order 8 and size 9; vertices a and e are adjacent while vertices a and b are 
nonadjacent; N(d) = {a, f,g}, N[d] = {a,d, f,g}; A(G) = 3, 6(G) = 1; and 
the degree sequence is 3, 3, 3, 2, 2, 2, 2, 1. 

The following theorem is often referred to as the First Theorem of Graph The- 
ory. 


Theorem 1.1. /n a graph G, the sum of the degrees of the vertices is equal to 
twice the number of edges. Consequently, the number of vertices with odd degree 
is even. 


Proof. Let S = >) <y deg(v). Notice that in counting S, we count each edge 
exactly twice. Thus, S = 2|F| (the sum of the degrees is twice the number of 
edges). Since S is even, it must be that the number of vertices with odd degree is 
even. oO 


Perambulation and Connectivity 


A walk in a graph is a sequence of (not necessarily distinct) vertices v1, v2,..., Ux 
such that v;u;41 € FE fori = 1,2,...,k —1. Such a walk is sometimes called 
a vy—vz walk, and v; and vz are the end vertices of the walk. If the vertices in a 
walk are distinct, then the walk is called a path. If the edges in a walk are distinct, 
then the walk is called a trail. In this way, every path is a trail, but not every trail 
is a path. Got it? 

A closed path, or cycle, is a path v1,..., vx (where k > 3) together with the 
edge v;,v;. Similarly, a trail that begins and ends at the same vertex is called a 
closed trail, or circuit. The length of a walk (or path, or trail, or cycle, or circuit) 
is its number of edges, counting repetitions. 

Once again, let’s illustrate these definitions with an example. In the graph of 
Figure 1.7, a,c, f, c, b, dis a walk of length 5. The sequence ), a, c, b, d represents 
a trail of length 4, and the sequence d, g, b, a, c, f, e represents a path of length 6. 
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a 


i es 
FA 
<a 


FIGURE 1.7. 


Also, g, d, b, c, a, b, g is a circuit, while e, d, b, a, c, f, e is a cycle. In general, it 
is possible for a walk, trail, or path to have length 0, but the least possible length 
of a circuit or cycle is 3. 

The following theorem is often referred to as the Second Theorem in this book. 


Theorem 1.2. Ina graph G with vertices u and v, every u—v walk contains a u-v 
path. 


Proof. Let W be a u—v walk in G. We prove this theorem by induction on the 
length of W. If W is of length | or 2, then it is easy to see that W must be a path. 
For the induction hypothesis, suppose the result is true for all walks of length less 
than k, and suppose W has length k. Say that W is 


U = Wo, W1,W2,--+,Wk-1, Wk = U 


where the vertices are not necessarily distinct. If the vertices are in fact distinct, 
then W itself is the desired u—v path. If not, then let 7 be the smallest integer such 
that w; = w, for some r > 7. Let W, be the walk 


U = W0,-+ +, Wi, Wrti,-++-,Wk = UV. 


This walk has length strictly less than /, and therefore the induction hypothesis 
implies that W; contains a u—v path. This means that W contains a u—v path, and 
the proof is complete. O 


We now introduce two different operations on graphs: vertex deletion and edge 
deletion. Given a graph G and a vertex v € V(G), we let G — v denote the graph 
obtained by removing v and all edges incident with v from G. If S is a set of 
vertices, we let G — S denote the graph obtained by removing each vertex of S' 
and all associated incident edges. If e is an edge of G, then G — e is the graph 
obtained by removing only the edge e (its end vertices stay). If Tis a set of edges, 
then G' — T is the graph obtained by deleting each edge of T' from G. Figure 1.8 
gives examples of these operations. 

A graph is connected if every pair of vertices can be joined by a path. Infor- 
mally, if one can pick up an entire graph by grabbing just one vertex, then the 
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-d ae G- cd 
: 4 stra) t G - {eg, fe} 
FIGURE 1.8. Deletion operations. 


“~ 


G G. Gc 


FIGURE 1.9. Connected and disconnected graphs. 


graph is connected. In Figure 1.9, G1 is connected, and both G2 and G3 are not 
connected (or disconnected). Each maximal connected piece of a graph is called 
a connected component. In Figure 1.9, G; has one component, G2 has three com- 
ponents, and G3 has two components. 

If the deletion of a vertex v from G causes the number of components to in- 
crease, then v is called a cut vertex. In the graph G of Figure 1.8, vertex d is a cut 
vertex and vertex c is not. Similarly, an edge e in G is said to be a bridge if the 
graph G' — e has more components than G. In Figure 1.8, the edge ab is the only 
bridge. 

A proper subset S of vertices of a graph G is called a vertex cut set (or simply, 
a cut set) if the graph G — S is disconnected. A graph is said to be complete if 
every vertex is adjacent to every other vertex. Consequently, if a graph contains at 
least one nonadjacent pair of vertices, then that graph is not complete. Complete 
graphs do not have any cut sets, since G — S is connected for all proper subsets S' 
of the vertex set. Every non-complete graph has a cut set, though, and this leads 
us to another definition. For a graph G which is not complete, the connectivity 
of G, denoted «(G), is the minimum size of a cut set of G. If G is a connected, 
non-complete graph of order n, then 1 < K(G) < n — 2. If G is disconnected, 
then «(G) = 0. If G is complete of order n, then we say that K(G) = n — 1. 
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Further, for a positive integer k, we say that a graph is k-connected if k < «(G). 
You will note here that “1-connected” simply means “connected.” 

Here are several facts that follow from these definitions. You will get to prove 
a couple of them in the exercises. 


i. A graph is connected if and only if «(G) > 1. 
ii. «(G) > 2 if and only if G is connected and has no cut vertices. 
iii. Every 2-connected graph contains at least one cycle. 
iv. For every graph G, K(G) < 6(G). 
Exercises 
1. If Gis a graph of order n, what is the maximum number of edges in G? 


2. Prove that for any graph G of order at least 2, the degree sequence has at 
least one pair of repeated entries. 


3. Consider the graph shown in Figure 1.10. 


a 


FIGURE 1.10. 


(a) How many different paths have c as an end vertex? 
(b) How many different paths avoid vertex c altogether? 


(c) What is the maximum length of a circuit in this graph? Give an exam- 
ple of such a circuit. 


(d) What is the maximum length of a circuit that does not include vertex 
c? Give an example of such a circuit. 


4. Is it true that a finite graph having exactly two vertices of odd degree must 
contain a path from one to the other? Give a proof or a counterexample. 


5. Let G be a graph where 6(G) > k. 


(a) Prove that G has a path of length at least k. 
(b) If k > 2, prove that G has a cycle of length at least / + 1. 


10 


11. 
12. 


13. 


14. 
15. 


16. 
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. Prove that every closed odd walk in a graph contains an odd cycle. 


. Draw a connected graph having at most 10 vertices that has at least one 


cycle of each length from 5 through 9, but has no cycles of any other length. 


. Let P; and P2 be two paths of maximum length in a connected graph G. 


Prove that P, and P2 have a common vertex. 


. Let G be a graph of order n that is not connected. What is the maximum 


size of G? 


. Let G be a graph of order n and size strictly less than n — 1. Prove that G 


is not connected. 
Prove that an edge e is a bridge of G if and only if e lies on no cycle of G. 
Prove or disprove each of the following statements. 


(a) If G has no bridges, then G has exactly one cycle. 
(b) If G has no cut vertices, then G has no bridges. 
(c) If G has no bridges, then G has no cut vertices. 


Prove or disprove: If every vertex of a connected graph G lies on at least 
one cycle, then G is 2-connected. 


Prove that every 2-connected graph contains at least one cycle. 
Prove that for every graph G, 

(a) K(G) < 6(G); 

(b) if 0(G) > n — 2, then K(G) = 6(G). 
Let G be a graph of order n. 


(a) If d(G) > 4, then prove that G is connected. 
(b) If 6(G) > 4, then show that G need not be connected. 


1.1.3 Special Types of Graphs 


until we meet again... 
— from An Irish Blessing 


In this section we describe several types of graphs. We will run into many of them 
later in the chapter. 


1. 


Complete Graphs 


We introduced complete graphs in the previous section. A complete graph 
of order n is denoted by K’,,, and there are several examples in Figure 1.11. 
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K3 K; Ky 
FIGURE 1.11. Examples of complete graphs. 


2. Empty Graphs 


The empty graph on n vertices, denoted by F,,, is the graph of order n 
where F is the empty set (Figure 1.12). 


FIGURE 1.12. An empty graph. 


3. Complements 


Given a graph G, the complement of G, denoted by G, is the graph whose 
vertex set is the same as that of G, and whose edge set consists of all the 
edges that are not present in G (Figure 1.13). 


G G 


FIGURE 1.13. A graph and its complement. 


4. Regular Graphs 


A graph G is regular if every vertex has the same degree. G is said to be 
regular of degree r (or r-regular) if deg(v) = r for all vertices v in G. 
Complete graphs of order n are regular of degree n — 1, and empty graphs 
are regular of degree 0. Two further examples are shown in Figure 1.14. 
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FIGURE 1.14. Examples of regular graphs. 


5. Cycles 
The graph C,, is simply a cycle on n vertices (Figure 1.15). 


FIGURE 1.15. The graph Cy. 


6. Paths 
The graph P,, is simply a path on n vertices (Figure 1.16). 


ee __e_e__e_-® 
FIGURE 1.16. The graph P¢. 


7. Subgraphs 


A graph H is a subgraph of a graph G if V(H) C V(G) and E(H) C 
E(G). In this case we write H C G, and we say that G contains H. In 
a graph where the vertices and edges are unlabeled, we say that H C G 
if the vertices could be labeled in such a way that V(H) C V(G) and 
E(H) C E(G). In Figure 1.17, H; and Hp are both subgraphs of G, but 
Hz is not. 


8. Induced Subgraphs 


Given a graph G anda subset S of the vertex set, the subgraph of G induced 
by S', denoted (S’), is the subgraph with vertex set S and with edge set 
{uv | u,v € S and uv € E(G)}. So, (S) contains all vertices of S' and 
all edges of G whose end vertices are both in S. A graph and two of its 
induced subgraphs are shown in Figure 1.18. 
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A, 


« & 


A, 


FIGURE 1.17. Hi and H2 are subgraphs of G. 


b c 
ADE le As 
g i e 
f e 
FIGURE 1.18. A graph and two of its induced subgraphs. 


9. Bipartite Graphs 


A graph G is bipartite if its vertex set can be partitioned into two sets X 
and Y in such a way that every edge of G has one end vertex in X and the 
other in Y. In this case, X and Y are called the partite sets. The first two 
graphs in Figure 1.19 are bipartite. Since it is not possible to partition the 
vertices of the third graph into two such sets, the third graph is not bipartite. 


Ee 


FIGURE 1.19. Two bipartite graphs and one non-bipartite graph. 


A bipartite graph with partite sets X and Y is called a complete bipartite 
graph if its edge set is of the form E = {ay | « € X,y € Y} (that is, if 
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every possible connection of a vertex of X with a vertex of Y is present in 
the graph). Such a graph is denoted by Kx, ;y;. See Figure 1.20. 


K,; K, 4 Ky, 


FIGURE 1.20. A few complete bipartite graphs. 


The next theorem gives an interesting characterization of bipartite graphs. 


Theorem 1.3. A graph with at least two vertices is bipartite if and only if it 
contains no odd cycles. 


Proof. Let G be a bipartite graph with partite sets X and Y. Let C be a cycle 
of G and say that C is v1, v2,..., Ug, V1. Assume without loss of generality that 
v1, € X. The nature of bipartite graphs implies then that v; € X for all odd 2, and 
vu; € Y for all even i. Since vz is adjacent to v;, it must be that & is even; and 
hence Cis an even cycle. 

For the reverse direction of the theorem, let G be a graph of order at least two 
such that G contains no odd cycles. Without loss of generality, we can assume 
that G is connected, for if not, we could treat each of its connected components 
separately. Let v be a vertex of G, and define the set X to be 


X = {x € V(G) | the shortest path from x to v has even length}, 


and let Y = V(G) \ X. 

Now let x and x’ be vertices of X, and suppose that x and x’ are adjacent. If 
x = v, then the shortest path from v to 2’ has length one. But this implies that 
x’ € Y, a contradiction. So, it must be that x 4 v, and by a similar argument, 
x’ # v. Let P; be a path from v to x of shortest length (a shortest v—a path) and 
let Pz be a shortest v—2’ path. Say that P, is v = vo, V1,..., Vak = x and that P, 
is U = Wo, W1,-.., Woz = x’. The paths P, and P» certainly have v in common. 
Let vu’ be a vertex on both paths such that the v’—a path, call it P/, and the v’—2’ 
path, call it P3, have only the vertex v’ in common. Essentially, v’ is the “last” 
vertex common to P; and P». It must be that P/ and P3 are shortest v’—a and 
v'—x’ paths, respectively, and it must be that v’ = v; = w; for some i. But since 
x and x’ are adjacent, vj, Vi41,..., Vok, Wat, Wot-1,---, W; is accycle of length 
(2k — 7) + (2t — 7) +1, which is odd, and that is a contradiction. 

Thus, no two vertices in X are adjacent to each other, and a similar argument 
shows that no two vertices in Y are adjacent to each other. Therefore, G is bipartite 
with partite sets X and Y. O 
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We conclude this section with a discussion of what it means for two graphs 


to be the same. Look closely at the graphs in Figure 1.21 and convince yourself 
that one could be re-drawn to look just like the other. Even though these graphs 


b 1 2 


8 4 
7 4 
6 5 


h 


FIGURE 1.21. Are these graphs the same? 


have different vertex sets and are drawn differently, it is still quite natural to think 
of these graphs as being the same. The idea of isomorphism formalizes this phe- 
nomenon. 

Graphs G and H are said to be isomorphic to one another (or simply, isomor- 
phic) if there exists a one-to-one correspondence f : V(G) — V(#) such that 
for each pair x,y of vertices of G, xy € E(G) if and only if f(x) f(y) € E(A). 
In other words, G' and H are isomorphic if there exists a mapping from one vertex 
set to another that preserves adjacencies. The mapping itself is called an isomor- 
phism. In our example, such an isomorphism could be described as follows: 


{(a, 1), (6,2), (¢,8), (4,3), (e, 7), (f,4), (9, 6), (2, 5)}. 


When two graphs G and H are isomorphic, it is not uncommon to simply say that 
G = FH or that “G is H.” As you will see, we will make use of this convention 
quite often in the sections that follow. 

Several facts about isomorphic graphs are immediate. First, if G and H are 
isomorphic, then |V(G)| = |V(#)| and |E(G)| = |E(1)|. The converse of this 
statement is not true, though, and you can see that in the graphs of Figure 1.22. 
The vertex and edge counts are the same, but the two graphs are clearly not iso- 


FIGURE 1.22. 
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morphic. 

A second necessary fact is that if G and H are isomorphic then the degree 
sequences must be identical. Again, the graphs in Figure 1.22 show that the con- 
verse of this statement is not true. A third fact, and one that you will prove in 
Exercise 8, is that if graphs G and H are isomorphic, then their complements G 
and H must also be isomorphic. 

In general, determining whether two graphs are isomorphic is a difficult prob- 
lem. While the question is simple for small graphs and for pairs where the ver- 
tex counts, edge counts, or degree sequences differ, the general problem is often 
tricky to solve. A common strategy, and one you might find helpful in Exercises 9 
and 10, is to compare subgraphs, complements, or the degrees of adjacent pairs 
of vertices. 


Exercises 
1. Forn > 1, prove that K,, has n(n — 1)/2 edges. 
2. If K;,,r. is regular, prove that r] = ro. 


3. Determine whether /‘4 is a subgraph of 4,4. If yes, then exhibit it. If no, 
then explain why not. 


4. Determine whether P, is an induced subgraph of v4.4. If yes, then exhibit 
it. If no, then explain why not. 


5. List all of the unlabeled connected subgraphs of C'34. 


6. The concept of complete bipartite graphs can be generalized to define the 
complete multipartite graph Ky, r5,...,.r,- This graph consists of k sets of 
vertices A;, Ao, ..., Ax, with |A;| = r; for each i, where all possible 
“interset edges” are present and no “intraset edges” are present. Find ex- 
pressions for the order and size of Ky, 73... .ry- 


7. The line graph L(G) of a graph G is defined in the following way: the 
vertices of L(G) are the edges of G, V(L(G)) = E(G), and two vertices 
in L(G) are adjacent if and only if the corresponding edges in G' share a 
vertex. 


(a) Let G be the graph shown in Figure 1.23. Find L(G). 


FIGURE 1.23. 
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(b) Find the complement of L(K‘5). 


(c) Suppose G has n vertices, labeled vj), ...v,, and the degree of vertex 
v; is r;. Let m denote the size of G, sory +ro+---+1Tn = 2m. Find 
formulas for the order and size of L(G) in terms of n, m, and the 7;. 


8. Prove that if graphs G and H are isomorphic, then their complements G 
and H are also isomorphic. 


9. Prove that the two graphs in Figure 1.24 are not isomorphic. 


gh 


FIGURE 1.24. 


10. Two of the graphs in Figure 1.25 are isomorphic. 


Ww 


FIGURE 1.25. 


(a) For the pair that is isomorphic, give an appropriate one-to-one corre- 
spondence. 


(b) Prove that the remaining graph is not isomporhic to the other two. 


1.2. Distance in Graphs 


‘Tis distance lends enchantment to the view... 
— Thomas Campbell, The Pleasures of Hope 


How far is it from one vertex to another? In this section we define distance in 
graphs, and we consider several properties, interpretations, and applications. Dis- 
tance functions, called metrics, are used in many different areas of mathematics, 
and they have three defining properties. If // is a metric, then 
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i. M(ax,y) > 0 forall x, y, and M(x, y) = 0 if and only if « = y; 
ii, M(x, y) = M(y, 2) for all x, y; 
iii, M(x, y) < M(x, z) + M(z, y) forall x, y, z. 


As you encounter the distance concept in the graph sense, verify for yourself that 
the function is in fact a metric. 


1.2.1 Definitions and a Few Properties 


I prefer the term ‘eccentric.’ 
— Brenda Bates, Urban Legend 


Distance in graphs is defined in a natural way: in a connected graph G, the dis- 
tance from vertex u to vertex v is the length (number of edges) of a shortest u—v 
path in G. We denote this distance by d(u,v), and in situations where clarity of 
context is important, we may write d¢(u, v). In Figure 1.26, d(b,k) = 4 and 
d(c,m) = 6. 


FIGURE 1.26. 


For a given vertex v of a connected graph, the eccentricity of v, denoted ecc(v), 
is defined to be the greatest distance from v to any other vertex. That is, 


ecc(v) = pe {d(v,x)}. 
In Figure 1.26, ecc(a) = 5 since the farthest vertices from a (namely k, m, n) are 
at a distance of 5 from a. 

Of the vertices in this graph, vertices c, k,m and n have the greatest eccentricity 
(6), and vertices e, f and g have the smallest eccentricity (3). These values and 
types of vertices are given special names. In a connected graph G, the radius of G, 
denoted rad(G), is the value of the smallest eccentricity. Similarly, the diameter 
of G, denoted diam(G), is the value of the greatest eccentricity. The center of the 
graph G is the set of vertices, v, such that ecc(v) = rad(G). The periphery of G 
is the set of vertices, wu, such that ecc(u) = diam(G). In Figure 1.26, the radius 
is 3, the diameter is 6, and the center and periphery of the graph are, respectively, 


12; fs gt and 1G k, ™m, n}. 
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Surely these terms sound familiar to you. On a disk, the farthest one can travel 
from one point to another is the disk’s diameter. Points on the rim of a disk are on 
the periphery. The distance from the center of the disk to any other point on the 
disk is at most the radius. The terms for graphs have similar meanings. 

Do not be misled by this similarity, however. You may have noticed that the 
diameter of our graph G is twice the radius of G. While this does seem to be a 
natural relationship, such is not the case for all graphs. Take a quick look at a 
cycle or a complete graph. For either of these graphs, the radius and diameter are 
equal! 

The following theorem describes the proper relationship between the radii and 
diameters of graphs. While not as natural, tight, or “circle-like” as you might 
hope, this relationship does have the advantage of being correct. 


Theorem 1.4. For any connected graph G, rad(G) < diam(G) < 2 rad(G). 


Proof. By definition, rad(G) < diam(G), so we just need to prove the second 
inequality. Let u and v be vertices in G such that d(u, v) = diam(G). Further, let 
c be a vertex in the center of G. Then, 


diam(G) = d(u,v) < d(u,c) + d(c,v) < 2ecc(c) = 2 rad(G). Oo 


The definitions in this section can also be extended to graphs that are not con- 
nected. In the context of a single connected component of a disconnected graph, 
these terms have their normal meanings. If two vertices are in different compo- 
nents, however, we say that the distance between them is infinity. 

We conclude this section with two interesting results. Choose your favorite 
graph. It can be large or small, dense with edges or sparse. Choose anything you 
like, as long as it is your favorite. Now, wouldn’t it be neat if there existed a graph 
in which your favorite graph was the “center” of attention? The next theorem 
(credited to Hedetneimi in [44]) makes your wish come true. 


Theorem 1.5. Every graph is (isomorphic to) the center of some graph. 


Proof. Let G be a graph (your favorite!). We now construct a new graph H (see 
Figure 1.27) by adding four vertices (w, x, y, z) to G, along with the following 
edges: 

{wz,yz}U {aa| ae V(G)}U {yb | be V(G)}. 


Now, ecc(w) = ecc(z) = 4, ecc(y) = ecc(x) = 3, and for any vertex v € V(G), 


"| Se 


FIGURE 1.27. G is the center. 


ecc(v) = 2. Therefore, G is the center of H. O 
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Suppose you don’t like being the center of attention. Maybe you would rather 
your favorite graph avoid the spotlight and stay on the periphery. The next theorem 
(due to Bielak and Systo, [25]) tells us when that can happen. 


Theorem 1.6. A graph G is (isomorphic to) the periphery of some graph if and 
only if either every vertex has eccentricity 1, or no vertex has eccentricity 1. 


Proof. Suppose that every vertex of G' has eccentricity 1. Not only does this mean 
that G' is complete, it also means that every vertex of G is in the periphery. G is 
the periphery of itself! 

On the other hand, suppose that no vertex of G has eccentricity 1. This means 
that for every vertex u of G, there is some vertex v of G such that wv ¢ E(G). 
Now, let H be a new graph, constructed by adding a single vertex, w, to G, to- 
gether with the edges {wax | « € V(G)}. In the graph H, the eccentricity of w is 
1 (w is adjacent to everything). Further, for any vertex x € V(G), the eccentricity 
of x in H is 2 (no vertex of G is adjacent to everything else in G, and everything 
in G is adjacent to w). Thus, the periphery of H is precisely the vertices of G. 

For the reverse direction, let us suppose that G has some vertices of eccentricity 
1 and some vertices of eccentricity greater than 1. Suppose also (in anticipation 
of a contradiction) that G forms the periphery of some graph, say H. Since the 
eccentricities of the vertices in G' are not all the same, it must be that V(G) is 
a proper subset of V(H). This means that H is not the periphery of itself and 
that diam(H) > 2. Now, let v be a vertex of G whose eccentricity in G is 1 (v 
is therefore adjacent to all vertices of G). Since v € V(G) and since G is the 
periphery of H, there exists a vertex w in H such that d(v, w) = diam(H) > 2. 
The vertex w, then, is also a peripheral vertex (see Exercise 4) and therefore must 
be in G. This contradicts the fact that v is adjacent to everything in G. O 


Exercises 


1. Find the radius, diameter and center of the graph shown in Figure 1.28. 


FIGURE 1.28. 


2. Find the radius and diameter of each of the following graphs: Pox, Po~+41, 
Cor, Cok+1; Kn, Kaine 


3. For each graph in Exercise 2, find the number of vertices in the center. 


4. If x is in the periphery of G and d(x, y) = ecc(x), then prove that y is in 
the periphery of G. 
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5. If u and v are adjacent vertices in a graph, prove that their eccentricities 
differ by at most one. 


6. A graph G is called self-centered if C(G) = V(G). Prove that every com- 
plete bipartite graph, every cycle, and every complete graph is self-centered. 


7. Given a connected graph G and a positive integer hk, the kth power of G, 
denoted G*, is the graph with V(G*) = V(G) and where vertices u and v 
are adjacent in G* if and only if dg(u, v) < k. 


(a) Draw the 2nd and 3rd powers of Pg and Co. 
(b) For a graph G of order n, what is G¢i2™(@) 7 


8. (a) Find a graph of order 7 that has radius 3 and diameter 6. 
(b) Find a graph of order 7 that has radius 3 and diameter 5. 
(c) Find a graph of order 7 that has radius 3 and diameter 4. 


(d) Suppose r and d are positive integers and r < d < 2r. Describe a 
graph that has radius r and diameter d. 


9. Suppose that u and v are vertices in a graph G, ecc(u) = m, ecc(v) = n, 
and m <n. Prove that d(u,v) > n—m. Then draw a graph G'; where 
d(u,v) = n—™m, and another graph Gz where d(u,v) > n — m. In each 
case, label the vertices u and v, and give the values of m and n. 


10. Let G be a connected graph with at least one cycle. Prove that G' has at least 
one cycle whose length is less than or equal to 2 diam(G) + 1. 


11. (a) Prove that if G is connected and diam(G) > 3, then G is connected. 


(b) Prove that if diam(G) > 3, then diam(G) < 3. 
(c) Prove that if G is regular and diam(G) = 3, then diam(G) = 2. 


1.2.2. Graphs and Matrices 


Unfortunately no one can be told what the Matrix is. You have to see 
it for yourself. 
— Morpheus, The Matrix 


What do matrices have to do with graphs? This is a natural question — nothing 
we have seen so far has suggested any possible relationship between these two 
types of mathematical objects. That is about to change! 

As we have seen, a graph is a very visual object. To this point, we have deter- 
mined distances by looking at the diagram, pointing with our fingers, and count- 
ing edges. This sort of analysis works fairly well for small graphs, but it quickly 
breaks down as the graphs of interest get larger. Analysis of large graphs often 
requires computer assistance. 
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Computers cannot just look and point at graphs like we can. Instead, they under- 
stand graphs via matrix representations. One such representation is an adjacency 
matrix. Let G be a graph with vertices v1, v2, ..., Un. The adjacency matrix of G 
is the n x n matrix A whose (i, j) entry, denoted by [.A];,;, is defined by 


[Alij = 1 if; and v; are adjacent, 
"J “" \) 0 otherwise. 


The graph in Figure 1.29 has six vertices. Its adjacency matrix, A, is 


OrRrOC Oo 
oo oc — Oo 
HH oo &S — © 
eFPrFeoCoorF 
For OOF 
OrRrRrFROCO 


FIGURE 1.29. 


Note that for simple graphs (where there are no loops) adjacency matrices have 
all zeros on the main diagonal. You can also see from the definition that these 
matrices are symmetric.! 

A single graph can have multiple adjacency matrices — different orderings of 
the vertices will produce different matrices. If you think that these matrices ought 
to be related in some way, then you are correct! In fact, if A and B are two differ- 
ent adjacency matrices of the same graph G, then there must exist a permutation 
of the vertices such that when the permuation is applied to the corresponding rows 
and columns of A, you get B. 

This fact can be used in reverse to determine if two graphs are isomorphic, 
and the permutation mentioned here serves as an appropriate bijection: Given two 
graphs G and G2 with respective adjacency matrices A and Ag, if one can apply 


'Can you think of a context in which adjacency matrices might not be symmetric? Direct your 
attention to Figure 1.3 for a hint. 
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a permutation to the rows and columns of A; and produce Ag, then G, and G2 
are isomorphic. 

Let’s take a closer look at the previous example. The fact that the (1,6) entry 
is 0 indicates that v; and vg are not adjacent. Consider now the (1,6) entry of the 
matrix A?. This entry is just the dot product of row one of A with column six of 
A: 

[A7],.¢ = (0,0,0, 1,1, 0) - (0,0, 1, 1,1, 0) 
= (0-0) + (0-0) + (0-1)+ (1-1) + (1-1) 4+ (0-0) 
= 2. 


Think about what makes this dot product nonzero. It is the fact that there was 
at least one place (and here there were two places) where a | in row one corre- 
sponded with a | in column six. In our case, the | in the fourth position of row 
one (representing the edge v,v4) matched up with the | in the fourth position of 
column six (representing the edge v4v¢). The same thing occurred in the fifth po- 
sition of the row and column (where the edges represented were v1 v5 and Us v6). 

Can you see what is happening here? The entry in position (1,6) of A? is equal 
to the number of two-edge walks from v; to vg in G. As the next theorem shows 
us, this is not a coincidence. 


Theorem 1.7. Let G be a graph with vertices labeled v1, v2, ..., Un, and let A 
be its corresponding adjacency matrix. For any positive integer k, the (i,j) entry 
of A® is equal to the number of walks from v; to v; that use exactly k edges. 


Proof. We prove this by induction on k. For k = 1, the result is true since [A];,; = 
1 exactly when there is a one-edge walk between v; and v;. 

Now suppose that for every i and j, the (i, 7) entry of A*~! is the number of 
walks from 1; to v; that use exactly k — 1 edges. For each k-edge walk from v; to 
v;, there exists an h such that the walk can be thought of as a (k — 1)-edge walk 
from v; to vp, combined with an edge from v;, to v;. The total number of these 
k-edge walks, then, is 


S- (number of (& — 1)-edge walks from 1; to vy). 
vnEN (vj) 


By the induction hypothesis, we can rewrite this sum as 


n 


S- [Ae ae =) a NinlA Jang = [A* lig , 


vn EN (v;) h=1 
and this proves the result. O 


This theorem has a straightforward corollary regarding distance between ver- 
tices. 


Corollary 1.8. Let G be a graph with vertices labeled v1, v2, ..., Un, and let 
A be its corresponding adjacency matrix. If d(v;,v;) = x, then [A*];,; = 0 for 
1<k< x 
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Let’s see if we can relate these matrices back to earlier distance concepts. Given 
a graph G of order n with adjacency matrix A, and given a positive integer k, 
define the matrix sum S; to be 


Sp=I+A+A?+---+A'*, 


where J is the n x n identity matrix. Since the entries of J and A are ones and 
zeros, the entries of S;, (for any ‘:) are nonnegative integers. This implies that for 
every pair i, j, we have [Sx]i,7 < [Sk+ili,j- 


Theorem 1.9. Let G be a connected graph with vertices labeled v,, v2, ..., Un 
and let A be its corresponding adjacency matrix. 


1. If k is the smallest positive integer such that row j of Si, contains no zeros, 
then ecc(v;) = k. 


2. If r is the smallest positive integer such that all entries of at least one row 
of 5; are positive, then rad(G) = r. 


3. If mis the smallest positive integer such that all entries of Sym, are positive, 
then diam(G) = m. 


Proof. We will prove the first part of the theorem. The proofs of the other parts 
are left for you as exercises.” 

Suppose that k is the smallest positive integer such that row 7 of 5S; contains 
no zeros. The fact that there are no zeros on row j of 5), implies that the distance 
from v; to any other vertex is at most k. If k = 1, the result follows immediately. 
For k > 1, the fact that there is at least one zero on row j of S;_ 1 indicates that 
there is at least one vertex whose distance from v; is greater than k — 1. This 
implies that ecc(v;) = k. O 


We can use adjacency matrices to create other types of graph-related matrices. 
The steps given below describe the construction of a new matrix, using the matrix 
sums 5}, defined earlier. Carefully read through the process, and (before you read 
the paragraph that follows!) see if you can recognize the matrix that is produced. 


Creating a New Matrix, (/ 


Given: A connected graph of order n, with adjacency matrix A, and with Sj, as 
defined earlier. 


1. Foreachi € {1,2,...,n}, let [M];,; =0. 


2. For each pair 7, 7 where i # J, let [/];,; = k where k is the least positive 
integer such that [S;];,; 4 0. 


2You’re welcome. 
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Can you see what the entries of M will be? For each pair i, j, the (2, 7) entry 
of M is the distance from v; to v;. That is, 


[M];,j = d(v;, v;). 
The matrix M/ is called the distance matrix of the graph G. 
Exercises 
1. Give the adjacency matrix for each of the following graphs. 


(a) Po, and P3,41, where the vertices are labeled from one end of the 
path to the other. 


(b) Co, and C2441, where the vertices are labeled consecutively around 
the cycle. 


(c) Km,n, where the vertices in the first partite set are labeled 11,..., Um. 
(d) K,,, where the vertices are labeled any way you please. 


2. Without computing the matrix directly, find A? where A is the adjacency 
matrix of K4. 


3. If A is the adjacency matrix for the graph G, show that the (j, 7) entry of 
A? is the degree of v;. 


4. Let A be the adjacency matrix for the graph G. 


(a) Show that the number of triangles that contain v, is $[A*];,;. 


(b) The trace of a square matrix /, denoted Tr(J/), is the sum of the 
entries on the main diagonal. Prove that the number of triangles in G 
is ¢ Tr(A). 


5. Find the (1,5) entry of A?°°? where A is the adjacency matrix of C19 and 
where the vertices of C19 are labeled consecutively around the cycle. 
6. (a) Prove the second statement in Theorem 1.9. 
(b) Prove the third statement in Theorem 1.9. 


7. Use Theorem 1.9 to design an algorithm for determining the center of a 
graph G. 


8. The graph G has adjacency matrix A and distance matrix D. Prove that if 
A = D, then G is complete. 


9. Give the distance matrices for the graphs in Exercise 1. You should create 
these matrices directly — it is not necessary to use the method described in 
the section. 
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1.2.3 Graph Models and Distance 


Do I know you? 
— Kevin Bacon, in Flatliners 


We have already seen that graphs can serve as models for all sorts of situations. 
In this section we will discuss several models in which the idea of distance is 
significant. 


The Acquaintance Graph 


“Wow, what a small world!” This familiar expression often follows the discovery 
of a shared acquaintance between two people. Such discoveries are enjoyable, 
for sure, but perhaps the frequency with which they occur ought to keep us from 
being as surprised as we typically are when we experience them. 

We can get a better feel for this phenomenon by using a graph as a model. 
Define the Acquaintance Graph, A, to be the graph where each vertex represents 
a person, and an edge connects two vertices if the corresponding people know 
each other. The context here is flexible — one could create this graph for the 
people living in a certain neighborhood, or the people working in a certain office 
building, or the people populating a country or the planet. Since the smaller graphs 
are all subgraphs of the graphs for larger populations, most people think of A in 
the largest sense: The vertices represent the Earth’s human population.? 

An interesting question is whether or not the graph A, in the large (Earth) sense, 
is connected. Might there be a person or a group of people with no connection 
(direct or indirect) at all to another group of people?* While there is a possibility 
of this being the case, it is most certainly true that if A is in fact disconnected, 
there is one very large connected component. 

The graph A can be illuminating with regard to the “six degrees of separation” 
phenomenon. Made popular (at least in part) by a 1967 experiment by social psy- 
chologist Stanley Milgram [204] and a 1990 play by John Guare [142], the “six 
degrees theory” asserts that given any pair of people, there is a chain of no more 
than six acquaintance connections joining them. Translating into graph theorese, 
the assertion is that diam(A) < 6. It is, of course, difficult (if not impossible) to 
confirm this. For one, A is enormous, and the kind of computation required for 
confirmation is nontrivial (to say the least!) for matrices with six billion rows. Fur- 
ther, the matrix A is not static — vertices and edges appear all of the time.> Still, 
the graph model gives us a good way to visualize this intriguing phenomenon. 

Milgram’s experiment [204] was an interesting one. He randomly selected sev- 
eral hundred people from certain communities in the United States and sent a 


3The graph could be made even larger by allowing the vertices to represent all people, living or 
dead. We will stick with the living people only — six billion vertices is large enough, don’t you think? 

4Wouldn’t it be interesting to meet such a person? Wait — it wouldn’t be interesting for long 
because as soon as you meet him, he is no longer disconnected! 

5Vertices will disappear if you limit A to living people. Edges disappear when amnesia strikes. 
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packet to each. Inside each packet was the name and address of a single “target” 
person. If the recipient knew this target personally, the recipient was to mail the 
packet directly to him. If the recipient did not know the target personally, the re- 
cipient was to send the packet to the person he/she thought had the best chance 
of knowing the target personally (perhaps someone in the same state as the target, 
or something like that). The new recipient was to follow the same rules: Either 
send it directly to the target (if known personally) or send it to someone who has 
a good chance of knowing the target. Milgram tracked how many steps it took for 
the packets to reach the target. Of the packets that eventually returned, the median 
number of steps was 5! Wow, what a small world! 


The Hollywood Graph 


Is the actor Kevin Bacon the center of Hollywood? This question, first asked by a 
group of college students in 1993, was the beginning of what was soon to become 
a national craze: The Kevin Bacon Game. The object of the game is to connect 
actors to Bacon through appearances in movies. For example, the actress Emma 
Thompson can be linked to Bacon in two steps: Thompson costarred with Gary 
Oldman in Harry Potter and the Prisoner of Azkaban (among others), and Oldman 
appeared with Bacon in JFK. Since Thompson has not appeared with Bacon in 
a movie, two steps is the best we can do. We say that Thompson has a Bacon 
number of 2. 

Can you sense the underlying graph here?° Let us define the Hollywood Graph, 
H, as follows: The vertices of H represent actors, and an edge exists between two 
vertices when the corresponding actors have appeared in a movie together. So, in 
f, Oldman is adjacent to both Bacon and Thompson, but Bacon and Thompson 
are not adjacent. Thompson has a Bacon number of 2 because the distance from 
her vertex to Bacon’s is 2. In general, an actor’s Bacon number is defined to be 
the distance from that actor’s vertex to Bacon’s vertex in 7. If an actor cannot be 
linked to Bacon at all, then that actor’s Bacon number is infinity. As was the case 
with the Acquaintance Graph, if H is disconnected we can focus our attention on 
the single connected component that makes up most of H (Bacon’s component). 

The ease with which Kevin Bacon can be connected to other actors might lead 
one to conjecture that Bacon is the unique center of Hollywood. In terms of graph 
theory, that conjecture would be that the center of H consists only of Bacon’s ver- 
tex. Is this true? Is Bacon’s vertex even in the center at all? Like the Acquaintance 
Graph, the nature of H changes frequently, and answers to questions like these 
are elusive. The best we can do is to look at a snapshot of the graph and answer 
the questions based on that particular point in time. 

Let’s take a look at the graph as it appeared on December 25, 2007. On that 
day, the Internet Movie Database [165] had records for nearly 1.3 million actors. 
Patrick Reynolds maintains a website [234] that tracks Bacon numbers, among 
other things. According to Reynolds, of the 1.3 million actors in the database on 


Sor, “Can you smell the Bacon?” 
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that day, 917,007 could be linked to Bacon in some way via chains of shared 
movie appearances. The maximum distance from Bacon to any of the actors in 
his component was 8 (and so Bacon’s eccentricity is 8). What about eccentricities 
of other actors? Are there any that are less than 8? According to Reynolds, the 
answer is no — 8 is the smallest eccentricity, and so Kevin Bacon is in the center 
of H. But it is very crowded there — thousands and thousands of other actors 
have eccentricity 8 as well. 


The Mathematical Collaboration Graph 


The Hungarian Paul Erdés (1913-1996) was one of the greatest and most pro- 
lific mathematicians of the twentieth century. Erdés authored or coauthored over 
1500 mathematical papers covering topics in graph theory, combinatorics, set the- 
ory, geometry, number theory, and more. He collaborated with hundreds of other 
mathematicians, and this collaboration forms the basis of a Bacon-like ranking 
system. While not as widely popular as Bacon numbers, almost all mathemati- 
cians are familiar with the concept of Erdés numbers. 

Erdés himself is assigned Erdés number 0. Any mathematician who coauthored 
a paper with Erdés has Erdés number |. If a person has coauthored a paper with 
someone who has an Erdés number of | (and if that person himself/herself doesn’t 
have Erd6s number 1), then that person has an Erdés number of 2. Higher Erdés 
numbers are assigned in a similar manner. 

The underlying graph here should be clear. Define the Mathematical Collabo- 
ration Graph, C’, to have vertices corresponding to researchers, and let an edge 
join two researchers if the two have coauthored a paper together. A researcher’s 
Erdés number, then, is the distance from the corresponding vertex to the vertex 
of Erdés. If a researcher is not in the same connected component of C’ as Erdés, 
then that researcher has an infinite Erd6s number. 

As you might imagine, new vertices and edges are frequently added to C’. Jerry 
Grossman maintains a website [140] that keeps track of Erd6s numbers. At one 
point in 2007, there were over 500 researchers with Erdés number | and over 
8100 with Erdés number 2. You might surmise that because Erdés died in 1996, 
the number of people with Erdés number | has stopped increasing. While this is 
surely to be true sometime in the near future, it hasn’t happened yet. A number of 
papers coauthored by Erdés have been published since his death. Erdés has not 
been communicating with collaborators from the great beyond (at least as far as 
we know) — it is simply the case that his collaborators continue to publish joint 
research that began years ago. 


Small World Networks 


AS we saw earlier, the Acquaintance Graph provides a way to model the famous 
“small world phemomenon” — the sense that humans are connected via numerous 
recognized and unrecognized connections. The immense size and dynamic nature 
of that graph make it difficult to analyze carefully and completely, and so smaller 
models can prove to be more useful. In order for the more manageable graphs to 
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be helpful, though, it is important that they enjoy some fundamental small world 
properties. 

So what makes a small world small? What properties should a graph have if 
it is to be a model of a small world? Let’s list a few. As you read through the 
list below, think about your own acquaintance network and see if these properties 
make sense to you. 


1. There should be plenty of mutual acquaintances (shared neighbors). If this 
were the only property, then complete graphs would surely fit the bill — 
lots of mutual neighbors there. A complete graph, though, is not a realistic 
model of acquaintances in the world. 


2. The graph should be sparse in edges. In a realistic model, there should be 
relatively few edges compared to the number of vertices in the graph. 


3. Distances between pairs of vertices should be relatively small. The char- 
acteristic path length of a graph G, denoted Lg, is the average distance 
between vertices, where the average is taken over all pairs of distinct ver- 
tices. In any graph of order n, there are |E(4,,)| distinct pairs of vertices, 
and in Exercise | of Section 1.1.3, you showed that |E(A,)| = n(n—1)/2. 


So for a graph G of order n, 
ar vEeV(G) d(u, v) 2 
Le = oOo DT FE omer d(u, v). 
AK) nn —1) x 


One way of obtaining this value for a graph is to find the mean of the non- 
diagonal entries in the distance matrix of the graph. 


4. There should be a reasonable amount of clustering in a small world graph. 
In actual acquaintance networks, there are a number of factors (geography, 
for instance) that create little clusters of vertices — small groups of vertices 
among which a larger than typical portion of edges exists. For example, 
there are likely to be many edges among the vertices that represent the 
people that live in your neighborhood. 


Given a vertex v in a graph of order n, we define its clustering coefficient, 
denoted cc(v), as follows (recall that (V[v]) is the subgraph induced by the 
closed neighborhood of v). 
») — LE UN) IE (CN ToI))I 
cc(v) = = 

|Z (Eis dente) ) | (1 + deg(v)) deg(v) 
For each vertex v, this is the percentage of edges that exist among the ver- 
tices in the closed neighborhood of v. For a graph G of order n, we define 
the clustering coefficient of the graph G’, denoted by CC(G) to be the aver- 
age of the clustering coefficients of the vertices of G’. That is, 


co(a) = = Sy ce(w), 


veEV(G) 
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Small world networks have the property that characteristic path lengths are low 
and clustering coefficients are high. Graphs that have these properties can be used 
as models in the mathematical analyses of the small world phenomenon and its 
associated concepts. It is interesting to note that other well known networks have 
exhibited small world traits — the internet, electric power grids, and even neural 
networks are examples — and this increases even further the applicability of graph 
models. 


Exercises 


1. Compute the characteristic path length for each of each of the following 
graphs: Po,,, Por+1, Cor, Corti, Kn, Kmn- 


2. Compute the clustering coefficient for each of each of the following graphs: 
Pr, Pop+i, Cor, Cor+41; Kn, Kmn- 


3. (a) In the Acquaintance Graph, try to find a path from your vertex to the 
vertex of the President of the United States. 


(b) Your path from the previous question may not be your shortest such 
path. Prove that your actual distance from the President is at most 
one away from the shortest such distance to be found among your 
classmates. 


Interesting Note: There are several contexts in which Bacon numbers can be cal- 
culuated. While Bacon purists only use movie connections, others include shared 
appearances on television and in documentaries as well. Under these more open 
guidelines, the mathematician Paul Erdés actually has a Bacon number of 3! Erdés 
was the focus of the 1993 documentary N is a Number [63]. British actor Alec 
Guinness made a (very) brief appearance near the beginning of that film, and 
Guinness has a Bacon number of 2 (can you find the connections?). As far as 
we know, Bacon has not coauthored a research article with anyone who is con- 


nected to Erdés, and so while Erdés’ Bacon number is 3, Bacon’s Erdés number 
is infinity. 


1.3 Trees 


“O look at the trees!” they cried, “O look at the trees!” 
— Robert Bridges, London Snow 


In this section we will look at the trees—but not the ones that sway in the wind 
or catch the falling snow. We will talk about graph-theoretic trees. Before moving 
on, glance ahead at Figure 1.30, and try to pick out which graphs are trees. 
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1.3.1 Definitions and Examples 


Example, the surest method of instruction. 
— Pliny the Younger 


In Figure 1.30 graphs A, B, and F are trees, while graphs C and D are not. 


A B Cc 


FIGURE 1.30. Which ones are trees? 


A tree is a connected graph that contains no cycles. Graph-theoretic trees re- 
semble the trees we see outside our windows. For example, graph-theoretic trees 
do not have cycles, just as the branches of trees in nature do not split and rejoin. 
The descriptive terminology does not stop here. 

Graph D in Figure 1.30 is not a tree; rather, it is a forest. A forest is a collection 
of one or more trees. A vertex of degree | in a tree is called a leaf. 

As in nature, graph-theoretic trees come in many shapes and sizes. They can 
be thin (Pio) or thick (4611900), tall (Piooo) or short (4%; and v2). Yes, even the 
graphs /, and K are considered trees (they are certainly connected and acyclic). 
In the spirit of our arboreal terminology, perhaps we should call Ky a stump and 
Koa twig ! 

While we are on the subject of small trees, we should count a few of them. It is 
clear that A‘, and F2 are the only trees of order 1 and 2, respectively. A moment’s 
thought will reveal that P3 is the only tree of order 3. Figure 1.31 shows the 
different trees of order 6 or less. 

Trees sprout up as effective models in a wide variety of applications. We men- 
tion a few brief examples. 


Examples 


1. Trees are useful for modeling the possible outcomes of an experiment. For 
example, consider an experiment in which a coin is flipped and a 6-sided die 
is rolled. The leaves in the tree in Figure 1.32 correspond to the outcomes 
in the probability space for this experiment. 
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FIGURE 1.31. Trees of order 6 or less. 


VON 
1 2 3 4 5 6 1 2 3 4 5 6 


FIGURE 1.32. Outcomes of a coin/die experiment. 


2. Programmers often use tree structures to facilitate searches and sorts and 
to model the logic of algorithms. For instance, the logic for a program that 
finds the maximum of four numbers (w, x, y, z) can be represented by the 
tree shown in Figure 1.33. This type of tree is a binary decision tree. 


3. Chemists can use trees to represent, among other things, saturated hydro- 
carbons—chemical compounds of the form C,,H2n+2 (propane, for exam- 
ple). The bonds between the carbon and hydrogen atoms are depicted in the 
trees of Figure 1.34. The vertices of degree 4 are the carbon atoms, and the 
leaves represent the hydrogen atoms. 


4. College basketball fans will recognize the tree in Figure 1.35. It displays 
final results for the “Sweet 16” portion of the 2008 NCAA men’s basketball 
tournament. Each vertex represents a single game. 
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FIGURE 1.33. Logic of a program. 


ethane propane pentane 


FIGURE 1.34. A few saturated hydrocarbons. 
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FIGURE 1.35. The 2008 Men’s Sweet 16. 


Exercises 


1. Draw all unlabeled trees of order 7. Hint: There are a prime number of 


them. 
2. Draw all unlabeled forests of order 6. 


3. Let T be a tree of order n > 2. Prove that T’ is bipartite. 


34 1. Graph Theory 


4. Graphs of the form Jc, ,,, are called stars. Prove that if K;., is a tree, then it 
must be a star. 


5. Match the graphs in Figure 1.36 with appropriate names: a palm tree, au- 
tumn, a path through a forest, tea leaves. 


4S\¢ B 
Ye 


FIGURE 1.36. What would you name these graphs? 


1.3.2 Properties of Trees 


And the tree was happy. 
— Shel Silverstein, The Giving Tree 


Let us try an experiment. On a piece of scratch paper, draw a tree of order 16. 
Got one? Now count the number of edges in the tree. We are going to go out on 
a limb here and predict that there are 15. Since there are nearly 20,000 different 
trees of order 16, it may seem surprising that our prediction was correct. The next 
theorem gives away our secret. 


Theorem 1.10. [fT is a tree of order n, then T has n — 1 edges. 


Proof. We induct on the order of T’. For n = 1 the only tree is the stump (/,), 
and it of course has 0 edges. Assume that the result is true for all trees of order 
less than k, and let T be a tree of order k. 

Choose some edge of T' and call it e. Since T' is a tree, it must be that JT’ — e 
is disconnected (see Exercise 7) with two connected components that are trees 
themselves (see Figure 1.37). Say that these two components of JT’ — e are T} 
and 75, with orders k, and kz, respectively. Thus, /; and kz are less than n and 
ky tke =k. 

Since k, < k, the theorem is true for T;. Thus JT) has k, — 1 edges. Similarly, 
T> has kz — 1 edges. Now, since E(T) is the disjoint union of E(T,), E(T>2), and 
{e}, we have |E(T)| = (ky — 1) + (kp —1) +1 =k +kp —1=k -—1. This 
completes the induction. O 
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FIGURE 1.37. 


The next theorem extends the preceding result to forests. The proof is similar 
and appears as Exercise 4. 


Theorem 1.11. Jf F is a forest of order n containing k connected components, 
then F contains n — k edges. 


The next two theorems give alternative methods for defining trees. Two other 
methods are given in Exercises 5 and 6. 


Theorem 1.12. A graph of order n is a tree if and only if it is connected and 
contains n — 1 edges. 


Proof. The forward direction of this theorem is immediate from the definition of 
trees and Theorem 1.10. For the reverse direction, suppose a graph G of order 
n is connected and contains n — 1 edges. We need to show that G is acyclic. If 
G did have a cycle, we could remove an edge from the cycle and the resulting 
graph would still be connected. In fact, we could keep removing edges (one at 
a time) from existing cycles, each time maintaining connectivity. The resulting 
graph would be connected and acyclic and would thus be a tree. But this tree 
would have fewer than n — 1 edges, and this is impossible by Theorem 1.10. 
Therefore, G has no cycles, so G is a tree. Oo 


Theorem 1.13. A graph of order n is a tree if and only if it is acyclic and contains 
n — | edges. 


Proof. Again the forward direction of this theorem follows from the definition of 
trees and from Theorem 1.10. So suppose that G is acyclic and has n — 1 edges. 
To show that G is a tree we need to show only that it is connected. Let us say that 
the connected components of G are Gy, G2,..., Gx. Since G is acyclic, each of 
these components is a tree, and so G is a forest. Theorem 1.11 tells us that G has 
n — k edges, implying that / = 1. Thus G has only one connected component, 
implying that G is a tree. O 


It is not uncommon to look out a window and see leafless trees. In graph theory, 
though, leafless trees are rare indeed. In fact, the stump (J) is the only such tree, 
and every other tree has at least two leaves. Take note of the proof technique of 
the following theorem. It is a standard graph theory induction argument. 


Theorem 1.14. Let T be the tree of order n > 2. Then T has at least two leaves. 


Proof. Again we induct on the order. The result is certainly true if nm = 2, since 
T = Kg in this case. Suppose the result is true for all orders from 2 to n — 1, and 
consider a tree T' of order n > 3. We know that T has n — 1 edges, and since 
we can assume n > 3, T' has at least 2 edges. If every edge of T is incident with 
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a leaf, then T’ has at least two leaves, and the proof is complete. So assume that 
there is some edge of T' that is not incident with a leaf, and let us say that this edge 
is e = uv. The graph T' — e is a pair of trees, J, and To, each of order less than n. 
Let us say, without loss of generality, that u € V(T1), v € V(T2), |V(T1)| = 41, 
and |V(Z2)| = nz (see Figure 1.38). Since e is not incident with any leaves of T, 


FIGURE 1.38. 


we know that m1 and nz are both at least 2, so the induction hypothesis applies to 
each of 7; and 7. Thus, each of 7; and T> has two leaves. This means that each 
of T; and T> has at least one leaf that is not incident with the edge e. Thus the 
graph (T'— e) + e = T has at least two leaves. O 


We saw in the previous section that the center of a graph is the set of vertices 
with minimum eccentricity. The next theorem, due to Jordan [170], shows that for 
trees, there are only two possibilities for the center. 


Theorem 1.15. Jn any tree, the center is either a single vertex or a pair of adja- 
cent vertices. 


Proof. Given a tree T’, we form a sequence of trees as follows. Let Tp = T’. Let 
T, be the graph obtained from To by deleting all of its leaves. Note here that T) 
is also a tree. Let T> be the tree obtained from T{ by deleting all of the leaves of 
T. In general, for as long as it is possible, let 7; be the tree obtained by deleting 
all of the leaves of T;_;. Since T’ is finite, there must be an integer r such that T;, 
is either Ky or Ko. 

Consider now a consecutive pair T;, 741 of trees from the sequence T’ = To, 
T\,..., I. Let v be a non-leaf of T;. In T;, the vertices that are at the greatest 
distance from v are leaves (of T;,). This means that the eccentricity of v in T;+1 is 
one less than the eccentricity of v in T;. Since this is true for all non-leaves of T;, 
it must be that the center of T+, is exactly the same as the center of 7}. 

Therefore, the center of J; is the center of T;._1, which is the center of T)._2, 
..., Which is the center of 7g = J’. Since (the center of) 7}. is either Ky or Ko, 
the proof is complete. O 


We conclude this section with an interesting result about trees as subgraphs. 


Theorem 1.16. Let T be a tree with k edges. If G is a graph whose minimum 
degree satisfies 6(G) > k, then G contains T as a subgraph. Alternatively, G 
contains every tree of order at most 6(G) + 1 as a subgraph. 
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Proof. We induct on k. If k = 0, then T = Kj, and it is clear that Ky is a 
subgraph of any graph. Further, if & = 1, then T = Ko, and Ko is a subgraph of 
any graph whose minimum degree is |. Assume that the result is true for all trees 
with k — 1 edges (k > 2), and consider a tree T with exactly k edges. We know 
from Theorem 1.14 that J’ contains at least two leaves. Let v be one of them, and 
let w be the vertex that is adjacent to v. Consider the graph T’ — v. Since T' — v 


T-v 
FIGURE 1.39. 


has k — 1 edges, the induction hypothesis applies, so T’ — v is a subgraph of G. 
We can think of 7’ — v as actually sitting inside of G (meaning w is a vertex of G, 
too). Now, since G contains at least k + 1 vertices and T’— v contains k vertices, 
there exist vertices of G that are not a part of the subgraph T’ — v. Further, since 
the degree in G' of w is at least k, there must be a vertex u not in J’ — v that is 
adjacent to w (Figure 1.40). The subgraph T’— v together with wu forms the tree T’ 


FIGURE 1.40. A copy of T inside G. 
as a subgraph of G. oO 


Exercises 
1. Draw each of the following, if you can. If you cannot, explain the reason. 
(a) A 10-vertex forest with exactly 12 edges 
(b) A 12-vertex forest with exactly 10 edges 
(c) A 14-vertex forest with exactly 14 edges 
(d) A 14-vertex forest with exactly 13 edges 
(e) A 14-vertex forest with exactly 12 edges 


2. Suppose a tree T has an even number of edges. Show that at least one vertex 
must have even degree. 


3. Let T be a tree with max degree A. Prove that T’ has at least A leaves. 
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4. Let F be a forest of order n containing k connected components. Prove that 
F contains n — k edges. 


5. Prove that a graph G is a tree if and only if for every pair of vertices u, v, 
there is exactly one path from uw to v. 


6. Prove that T’ is a tree if and only if 7’ contains no cycles, and for any new 
edge e, the graph T' + e has exactly one cycle. 


7. Show that every edge in a tree is a bridge. 
8. Show that every nonleaf in a tree is a cut vertex. 


9. Find a shorter proof to Theorem 1.14. Hint: Start by considering a longest 
path in 7. 


10. Let T' be a tree of order n > 1. Show that the number of leaves is 


2+ > (deg(v:) —2), 


deg(vi)23 
where the sum is over all vertices of degree 3 or more. 


11. Fora graph G, define the average degree of G' to be 


dvev(a) deg(v) 
avgdeg(G) = Svev(G) Ph 
|V(G)| 
If T is a tree and avgdeg(T) = a, then find an expression for the number 
of vertices in T’ in terms of a. 


12. Let T be a tree such that every vertex adjacent to a leaf has degree at least 
3. Prove that some pair of leaves in T’ has a common neighbor. 


1.3.3 Spanning Trees 


Under the spreading chestnut tree... 
— Henry W. Longfellow, The Village Blacksmith 


The North Carolina Department of Transportation (NCDOT) has decided to im- 
plement a rapid rail system to serve eight cities in the western part of the state. 
Some of the cities are currently joined by roads or highways, and the state plans 
to lay the track right along these roads. Due to the mountainous terrain, some of 
the roads are steep and curvy; and so laying track along these roads would be 
difficult and expensive. The NCDOT hired a consultant to study the roads and to 
assign difficulty ratings to each one. The rating accounted for length, grade, and 
curviness of the roads; and higher ratings correspond to greater cost. The graph 
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Jefferson 


Banner Elk 


Linville 


Marion 


FIGURE 1.41. The city graph. 


in Figure 1.41, call it the “city graph,” shows the result of the consultant’s inves- 
tigation. The number on each edge represents the difficulty rating assigned to the 
existing road. 

The state wants to be able to make each city accessible (but not necessarily 
directly accessible) from every other city. One obvious way to do this is to lay 
track along every one of the existing roads. But the state wants to minimize cost, 
so this solution is certainly not the best, since it would result in a large amount 
of unnecessary track. In fact, the best solution will not include a cycle of track 
anywhere, since a cycle would mean at least one segment of wasted track. 

The situation above motivates a definition. Given a graph G' and a subgraph T, 
we say that T is a spanning tree of G if T is a tree that contains every vertex of 
G. 

So it looks as though the DOT just needs to find a spanning tree of the city 
graph, and they would like to find one whose overall rating is as small as possible. 
Figure 1.42 shows several attempts at a solution. 

Of the solutions in the figure, the one in the upper right has the least total 
weight—but is it the best one overall? Try to find a better one. We will come back 
to this problem soon. 

Given a graph G, a weight function is a function W that maps the edges of 
G to the nonnegative real numbers. The graph G together with a weight func- 
tion is called a weighted graph . The graph in Figure 1.41 is a simple example of 
a weighted graph. Although one might encounter situations where negative val- 
ued weights would be appropriate, we will stick with nonnegative weights in our 
discussion. 

It should be fairly clear that every connected graph has at least one spanning 
tree. In fact, it is not uncommon for a graph to have many different spanning trees. 
Figure 1.42 displays three different spanning trees of the city graph. 

Given a connected, weighted graph G,, a spanning tree T is called a minimum 
weight spanning tree if the sum of the weights of the edges of T’ is no more than 
the sum for any other spanning tree of G. 
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Total Weight = 325 Total Weight = 240 


10 
Total Weight = 290 


FIGURE 1.42. Several spanning trees. 


There are a number of fairly simple algorithms for finding minimum weight 
spanning trees. Perhaps the best known is Kruskal’s algorithm. 


Kruskal’s Algorithm 


Given: A connected, weighted graph G. 
i. Find an edge of minimum weight and mark it. 


ii. Among all of the unmarked edges that do not form a cycle with any of the 
marked edges, choose an edge of minimum weight and mark it. 


ili. If the set of marked edges forms a spanning tree of G, then stop. If not, 
repeat step ii. 


Figure 1.43 demonstrates Kruskal’s algorithm applied to the city graph. The 
minimum weight is 210. 

It is certainly possible for different trees to result from two different appli- 
cations of Kruskal’s algorithm. For instance, in the second step we could have 
chosen the edge between Marion and Lenoir instead of the one that was chosen. 
Even so, the total weight of resulting trees is the same, and each such tree is a 
minimum weight spanning tree. 
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FIGURE 1.43. The stages of Kruskal’s algorithm. 


It should be clear from the algorithm itself that the subgraph built is in fact 
a spanning tree of G. How can we be sure, though, that it has minimum total 
weight? The following theorem answers our question [183]. 


Theorem 1.17. Kruskal’s algorithm produces a spanning tree of minimum total 
weight. 


Proof. Let G be a connected, weighted graph of order n, and let T’ be a spanning 
tree obtained by applying Kruskal’s algorithm to G. As we have seen, Kruskal’s 
algorithm builds spanning trees by adding one edge at a time until a tree is formed. 
Let us say that the edges added for T’ were (in order) €1, €2,..., €n—1. Suppose 
T is not a minimum weight spanning tree. Among all minimum weight spanning 
trees of G, choose T” to be a minimum weight spanning tree that agrees with 
the construction of T' for the longest time (i.e., for the most initial steps). This 
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means that there exists some k such that T’ contains e),...,e,, and no minimum 
weight spanning tree contains all of e1,..., ex, €x41 (notice that since T is not of 
minimum weight, k <n — 1). 

Since T’ is a spanning tree, it must be that T’ + e,41 contains a cycle C, and 
since T contains no cycles, C' must contain some edge, say e’, that is not in T. 
If we remove the edge e’ from T’ + e,41, then the cycle C' is broken and what 
remains is a spanning tree of G. Thus, T’ + e,+1 — e’ is a spanning tree of G, and 
it contains edges €1,..., €x, €x+1- Furthermore, since the edge e’ must have been 
available to be chosen when e,41 was chosen by the algorithm, it must be that 
w(en41) < w(e’). This means that T’ + e,41 — e’ is a spanning tree with weight 


no more than T” that contains edges e1,..., €x41, contradicting our assumptions. 
Therefore, it must be that T is a minimum weight spanning tree. O 
Exercises 


1. Prove that every connected graph contains at least one spanning tree. 


2. Prove that a graph is a tree if and only if it is connected and has exactly one 
spanning tree. 


3. Let G be a connected graph with n vertices and at least n edges. Let C' be a 
cycle of G. Prove that if T is a spanning tree of G, then T, the complement 
of T,, contains at least one edge of C. 


4. Let G be connected, and let e be an edge of G. Prove that e is a bridge if 
and only if it is in every spanning tree of G. 


5. Using Kruskal’s algorithm, find a minimum weight spanning tree of the 
graphs in Figure 1.44. In each case, determine (with proof) whether the 
minimum weight spanning tree is unique. 


8 11 
: ny aN 
7 12 
7 


FIGURE 1.44. Two weighted graphs. 
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6. Prim’s algorithm (from [228]) provides another method for finding mini- 
mum weight spanning trees. 


Prim’s Algorithm 


Given: A connected, weighted graph G. 


i. Choose a vertex v, and mark it. 


ii. From among all edges that have one marked end vertex and one un- 
marked end vertex, choose an edge e of minimum weight. Mark the 
edge e, and also mark its unmarked end vertex. 


iii. If every vertex of G' is marked, then the set of marked edges forms a 
minimum weight spanning tree. If not, repeat step ii. 


Use Prim’s algorithm to find minimum weight spanning trees for the graphs 
in Figure 1.44. As you work, compare the stages to those of Kruskal’s al- 
gorithm. 


7. Give an example of a connected, weighted graph G having (i) a cycle with 
two identical weights, which is neither the smallest nor the largest weight in 
the graph, and (ii) a unique minimum weight spanning tree which contains 
exactly one of these two identical weights. 


1.3.4 Counting Trees 


As for everything else, so for a mathematical theory: beauty can be 
perceived but not explained. 
— Arthur Cayley [214] 


In this section we discuss two beautiful results on counting the number of span- 
ning trees in a graph. The next chapter studies general techniques for counting 
arrangements of objects, so these results are a sneak preview. 


Cayley’s Tree Formula 


Cayley’s Tree Formula gives us a way to count the number of different labeled 
trees on n vertices. In this problem we think of the vertices as being fixed, and 
we consider all the ways to draw a tree on those fixed vertices. Figure 1.45 shows 
three different labeled trees on three vertices, and in fact, these are the only three. 


There are 16 different labeled trees on four vertices, and they are shown in 
Figure 1.46. 

As an exercise, the ambitious student should try drawing all of the labeled trees 
on five vertices. The cautious ambitious student might wish to look ahead at Cay- 
ley’s formula before embarking on such a task. 
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FIGURE 1.45. Labeled trees on three vertices. 
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FIGURE 1.46. Labeled trees on four vertices. 


Cayley proved the following theorem in 1889 [50]. The proof technique that we 
will describe here is due to Priifer’ [229]. Priifer’s method is almost as noteworthy 
as the result itself. He counted the labeled trees by placing them in one-to-one cor- 
respondence with a set whose size is easy to determine—the set of all sequences 
of length n — 2 whose entries come from the set {1,...,}. There are n”~? such 
sequences. 


Theorem 1.18 (Cayley’s Tree Formula). There are n”~? distinct labeled trees of 
order n. 


The algorithm below gives the steps that Priifer used to assign a particular se- 
quence to a given tree, 7’, whose vertices are labeled 1,...,7. Each labeled tree 
is assigned a unique sequence. 


7 With a name like that he was destined for mathematical greatness! 
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Priifer’s Method for Assigning a Sequence to a Labeled Tree 
Given: A tree 7’, with vertices labeled 1,...,n. 

1. Let i = 0, and let Tp = T. 

2. Find the leaf on 7; with the smallest label and call it v. 

3. Record in the sequence the label of v’s neighbor. 

4. Remove v from T; to create a new tree T;+1. 


5. If Tj41 = Ko, then stop. Otherwise, increment 7 by | and go back to step 
2. 


Let us run through this algorithm with a particular graph. In Figure 1.47, tree 
T = To has 7 vertices, labeled as shown. The first step is finding the leaf with 
smallest label: This would be 2. The neighbor of vertex 2 is the vertex labeled 
4. Therefore, 4 is the first entry in the sequence. Removing vertex 2 produces 
tree T;. The leaf with smallest label in T; is 4, and its neighbor is 3. Therefore, 
we put 3 in the sequence and delete 4 from 7). Vertex 5 is the smallest leaf in 
tree T2 = T, — {4}, and its neighbor is 1. So our sequence so far is 4, 3, 1. In 
T3 = T2—{5} the smallest leaf is vertex 6, whose neighbor is 3. In T, = T3—{6}, 
the smallest leaf is vertex 3, whose neighbor is 1. Since T; = Ko, we stop here. 
Our resulting sequence is 4, 3, 1, 3, 1. 

Notice that in the previous example, none of the leaves of the original tree 7’ 
appears in the sequence. More generally, each vertex v appears in the sequence 
exactly deg(v) — 1 times. This is not a coincidence (see Exercise 1). We now 
present Priifer’s algorithm for assigning trees to sequences. Each sequence gets 
assigned a unique tree. 


Priifer’s Method for Assigning a Labeled Tree to a Sequence 
Given: A sequence o = a1, 02,...,@ of entries from the set {1,...,& +2}. 
1. Draw k+2 vertices; label them 1, v2, ..., Ugp2. Let S = {1,2,...,k4+2}. 
2. Let i = 0, let o9 = o, and let Sp = S. 

3. Let 7 be the smallest number in S; that does not appear in the sequence o;. 


4. Place an edge between vertex v; and the vertex whose subscript appears 
first in the sequence o;. 


5. Remove the first number in the sequence o; to create a new sequence o;+1. 
Remove the element 7 from the set 5; to create a new set Sj+1. 


6. If the sequence o;,1 is empty, place an edge between the two vertices 
whose subscripts are in S;41, and stop. Otherwise, increment 7 by | and 
return to step 3. 
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FIGURE 1.47. Creating a Priifer sequence. 


Let us apply this algorithm to a particular example. Let o = 4,3,1,3,1 be 
our initial sequence to which we wish to assign a particular labeled tree. Since 
there are five terms in the sequence, our labels will come from the set S = 
{1,2,3,4,5,6, 7}. After drawing the seven vertices, we look in the set S = So 
to find the smallest subscript that does not appear in the sequence o = ao. Sub- 
script 2 is the one, and so we place an edge between vertices v2 and va, the first 
subscript in the sequence. We now remove the first term from the sequence and 
the label va from the set, forming a new sequence 0, = 3,1,3,1 and a new 
set S; = {1,3,4,5,6,7}. The remaining steps in the process are shown in Fig- 
ure 1.48. 
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FIGURE 1.48. Building a labeled tree. 


You will notice that the tree that was created from the sequence o in the second 
example is the very same tree that created the sequence o in the first example. 
Score one for Priifer! 


Matrix Tree Theorem 


The second major result that we present in this section is the Matrix Tree The- 
orem, and like Cayley’s Theorem, it provides a way of counting spanning trees 
of labeled graphs. While Cayley’s Theorem in essence gives us a count on the 
number of spanning trees of complete labeled graphs, the Matrix Tree Theorem 
applies to labeled graphs in general. The theorem was proved in 1847 by Kirch- 
hoff [175], and it demonstrates a wonderful connection between spanning trees 
and matrices. 
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The theorem involves two special matrices. One is the adjacency matrix (de- 
fined back in Section 1.2.2), and the other is defined as follows. Let G be a graph 
with vertices v1, V2,...Un. The degree matrix of G is the n x n matrix D whose 
(i,j) entry, denoted by [D];,;, is defined by 

_ f deg(u;) ift= 9, 
[Ding = { 0 otherwise. 
So, the diagonal entries of D are the vertex degrees, and the off-diagonal entries 
are all zero. 
Given ann x n matrix M, the 7,7 cofactor of M is defined to be 


(-1)'*7 det(M(ilj)), 


where det(/(i|j)) represents the determinant of the (n — 1) x (nm — 1) matrix 
formed by deleting row 7 and column 7 from M. 

We are now ready to state the Matrix Tree Theorem, due to Kirchhoff. The 
proof that we give imitates those presented in [148] and [52]. 


Theorem 1.19 (Matrix Tree Theorem). /f G is a connected labeled graph with 
adjacency matrix A and degree matrix D, then the number of unique spanning 
trees of G is equal to the value of any cofactor of the matrix D — A. 


Proof. Suppose G has n vertices (v1,...,Un) and k edges (f1,..., fx). Since G 


is connected, we know that k& is at least n — 1. Let N be the n x k matrix whose 
(i, 7) entry is defined by 


1 if vu; and f; are incident, 
[M]i.3 = { fi 


0 otherwise. 


N is called the incidence matrix of G. Since every edge of G is incident with 
exactly two vertices of G, each column of N contains two 1’s and n — 2 zeros. 
Let M be the n x k matrix that results from changing the topmost | in each 
column to —1. To prove the result, we first need to establish two facts, which we 
call Claim A and Claim B. 


Claim A. MMT = D — A (where M7 denotes the transpose of /). 
First, notice that the (i, 7) entry of D — A is 
deg(u;) ift =, 
[D-—A]i;= 4 —-1 ifi A j and vv; € E(G), 
0 ifi A j and yw; ¢ E(G). 


Now, what about the (i, ) entry of MM7? The rules of matrix multiplication tell 
us that this entry is the dot product of row i of M and column j of M7. That is, 


[MM*], = ([Mii, [M]i2,---,[M]in) - (M7 )1,5, [M7 ]23,---, [M7 e3) 
= ([M]ia, [M]i2,---,[Mix) - ([M]3,1, [M]3,2,--- [M]j,«) 


1.3 Trees 49 


If 2 = 7, then this sum counts one for every nonzero entry in row 2; that is, it 
counts the degree of v;. Ifi £ j and vjv; ¢ E(G), then there is no column of 
in which both the row 7 and row 7 entries are nonzero. Hence the value of the sum 
in this case is 0. If i A j and u,v; € E(G), then the only column in which both 
the row 2 and the row 7 entries are nonzero is the column that represents the edge 
v;vj;. Since one of these entries is | and the other is —1, the value of the sum is 
—1. We have shown that the (i, 7) entry of MM M7 is the same as the (i, j) entry 
of D — A, and thus Claim A is proved. 

Let H be a subgraph of G with n vertices and n — 1 edges. Let p be an arbitrary 
integer between 1 and n, and let MM’ be the (n — 1) x (nm — 1) submatrix of 
formed by all rows of M except row p and the columns that correspond to the 
edges in H. 


Claim B. If H is a tree, then | det(1/’)| = 1. Otherwise, det(M/’) = 0. 


First suppose that H is nota tree. Since H has n vertices and n—1 edges, we know 
from earlier work that H must be disconnected. Let H, be a connected component 
of H that does not contain the vertex v,. Let M” be the |V(H1)| x (n — 1) 
submatrix of /’ formed by eliminating all rows other than the ones corresponding 
to vertices of H,. Each column of 14” contains exactly two nonzero entries: 1 and 
—1. Therefore, the sum of all of the row vectors of IM” is the zero vector, so the 
rows of M/” are linearly dependent. Since these rows are also rows of M’, we see 
that det (/’) = 0. 

Now suppose that H is a tree. Choose some leaf of A that is not v, (Theo- 
rem 1.14 lets us know that we can do this), and call it u;. Let us also say that e; is 
the edge of #7 that is incident with wy. In the tree H — uj, choose ug to be some 
leaf other than v,. Let e2 be the edge of H — wu, incident with uz. Keep removing 
leaves in this fashion until vu, is the only vertex left. Having established the list of 
vertices U1, U2,...,Un—1, We now create a new (n — 1) x (n — 1) matrix M* by 
rearranging the rows of M’ in the following way: row i of M* will be the row of 
M’ that corresponds to the vertex uj. 

An important (i.e., useful!) property of the matrix 1/* is that it is lower tri- 
angular (we know this because for each 7, vertex u; is not incident with any of 
€i+1; €i+2;---;€n—1). Thus, the determinant of (/* is equal to the product of the 
main diagonal entries, which are either 1 or —1, since every wu, is incident with e;. 
Thus, | det(1/*)| = 1, and so | det(M’)| = 1. This proves Claim B. 

We are now ready to investigate the cofactors of D— A = MM7. It isa 
fact from matrix theory that if the row sums and column sums of a matrix are all 
0, then the cofactors all have the same value. (It would be a nice exercise—and a 
nice review of matrix skills—for you to try to prove this.) Since the matrix MMT 
satisfies this condition, we need to consider only one of its cofactors. We might 
as well choose 2 and 7 such that 7 + 7 is even—let us choose i = 1 and 7 = 1. So, 
the (1,1) cofactor of D — A is 


det ((D — A)(1|1)) = det (MM7(1/1)) 
= det(M,M7) 
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where I, is the matrix obtained by deleting the first row of D — A. 

At this point we make use of the Cauchy—Binet Formula, which says that the 
determinant above is equal to the sum of the determinants of (n — 1) x (n — 1) 
submatrices of (/, (for a more thorough discussion of the Cauchy—Binet Formula, 
see [40]). We have already seen (in Claim B) that any (n — 1) x (n— 1) submatrix 
that corresponds to a spanning tree of G will contribute | to the sum, while all 
others contribute 0. This tells us that the value of det(D — A) = det(MM7) is 
precisely the number of spanning trees of G. oO 


Figure 1.49 shows a labeled graph G and each of its eight spanning trees. 
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FIGURE 1.49. A labeled graph and its spanning trees. 


The degree matrix D and adjacency matrix A are 


2 0 0 0 0 0 1 1 
0 2 0 0 0 0 1 1 
P=) gf 30. |%* 8" la 2G 2] 
00 0 3 1 1 1 0 


and so 
2 QO -1l -1l 
0 2 -1 -l 
—-1 -1 3 -!1 
—-1 -1 -1 38 


D-A= 


The (1,1) cofactor of D — A is 


2 =L: =] 
det} -1 3 -1]=8. 
=] =L 3 


Score one for Kirchhoff! 
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Exercises 


1. 


1.4 


Let T be a labeled tree. Prove that the Priifer sequence of T’ will not contain 
any of the leaves’ labels. Also prove that each vertex v will appear in the 
sequence exactly deg(v) — 1 times. 


. Determine the Priifer sequence for the trees in Figure 1.50. 


6 7 8 9 10 II 


FIGURE 1.50. Two labeled trees. 


. Draw and label a tree whose Priifer sequence is 


5,4,3,5,4,3,5,4,3. 


. Which trees have constant Priifer sequences? 
. Which trees have Priifer sequences with distinct terms? 


. Let e be an edge of K,,. Use Cayley’s Theorem to prove that i, — e has 


(n — 2)n”~% spanning trees. 


. Use the Matrix Tree Theorem to prove Cayley’s Theorem. Hint: Look back 


at the discussion prior to the statement of the Matrix Tree Theorem. 


Trails, Circuits, Paths, and Cycles 


Takes a real salesman, I can tell you that. Anvils have a limited 
appeal, you know. 
— Charlie Cowell, anvil salesman, The Music Man 


Charlie Cowell was a door to door anvil salesman, and he dragged his heavy wares 
down every single street in each town he visited. Not surprisingly, Charlie became 
quite proficient at designing routes that did not repeat many streets. He certainly 
did not want to drag the anvils any farther than necessary, and he especially liked 
it when he could cover every street in the town without repeating a single one. 
After several years of unsuccessful sales (he saw more closed doors than closed 
deals), the Acme Anvil Company did the natural thing — they promoted him. 
Charlie moved from salesman to regional supplier. This meant that Charlie would 
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be in charge of driving the anvil truck from town to town, delivering each town’s 
supply of anvils. Still efficiency-minded, he wanted to plan driving routes that 
did not repeat any town along the way. He had been very good at route planning 
during his door to door days, and avoiding the repetition of towns was basically 
the same as avoiding the repetition of streets, right? 

As you read through this section, see if you can answer that question for your- 
self. 


1.4.1. The Bridges of K6nigsberg 


One should make a serious study of a pastime. 
— Alexander the Great 


At the very beginning of this chapter, we referred to the legendary K6nigsberg 
Bridge Problem. As you will recall, this problem concerned the existence (or non- 
existence) of a certain type of route across a group of bridges (Figure 1.1). Could 
one design a route that crossed each bridge exactly once? The residents of seven- 
teenth and eighteenth century K6nigsberg passed the time making valiant efforts, 
but no route could be found. 

In 1736, the Swiss mathematician Euler addressed the problem ([98], translated 
in [26]). Near the beginning of his article, Euler described his thoughts as he 
embarked on the search for a solution. 


As far as the problem of the seven bridges of K6nigsberg is con- 
cerned, it can be solved by making an exhaustive list of all possible 
routes, and then finding whether or not any route satisfies the con- 
ditions of the problem. Because of the number of possibilities, this 
method of solution would be too difficult and laborious, and in other 
problems with more bridges it would be impossible ... Hence I re- 
jected it, and looked for another method concerned only with the 
problem of whether or not the specified route could be found; I con- 
sidered that such a method would be much simpler. [26] 


This passage is enlightening on several levels. For one, it gives us a glimpse of 
the teacher/expositor side of the master mathematician Euler. It is doubtful that 
he would have seriously considered listing all possible routes in search of a satis- 
factory one. His mention of the possibility, though, does help the reader progress 
along a natural arc of thought regarding the solution. The passage also gives a clue 
as to what Euler is really after — not just a solution to the problem in K6nigsberg, 
but a general solution that could be applied in other land/bridge formations. 
Using a figure similar to Figure 1.51, he used sequences of letters to describe 
routes — routes where no bridges were repeated. For instance, the sequence 
ABDACAB represented a route that started at A, crossed a bridge to B, crossed 
a bridge to D, crossed a bridge back to A, crossed a bridge to C, crossed a bridge 
back to A, and then crossed a bridge to B. This seven letter sequence includes 
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FIGURE 1.51. Layout of the bridges in K6nigsberg. 


six of the seven bridges in K6nigsberg, and these bridges can be identified by the 
consecutive pairs in the sequence: AB, BD, DA, AC, CA, AB. 

For Euler the K6nigsberg Bridge Problem boiled down to finding a certain 
sequence of letters. He described it in this way: 


The problem is therefore reduced to finding a sequence of eight 
letters, formed from the four letters A, B, C’, and D, in which the 
various pairs of letters occur the required number of times. Before I 
turn to the problem of finding such a sequence, it would be useful to 
find out whether or not it is even possible to arrange the letters in this 
way, for if it were possible to show that there is no such arrangement, 
then any work directed towards finding it would be wasted. I have 
therefore tried to find a rule which will be useful in this case, and in 
others, for determining whether or not such an arrangement can exist. 
[26] 


Euler argued that since land area D is connected to three bridges, then D must ap- 
pear in the sequence two times. (If it appeared only once, this would not account 
for all of the bridges; if it appeared more than twice, this would represent multiple 
crossing of at least one bridge.) Similarly, Euler argued, B and C' must each ap- 
pear twice in the eight letter sequence. Further, since land area A is connected to 
five bridges, the letter A must appear three times in the sequence (you will verify 
this in Exercise la). This means that the necessary eight letter sequence would 
have three As, two Bs, two C's and two Ds. In Euler’s words, 


It follows that such a journey cannot be undertaken across the seven 
bridges of KGnigsberg. [26] 


Once he had settled the problem of the K6nigsberg bridges, Euler used the same 
ideas and methods to present a more general result, and we will see that result in 
the next section. As you see, Euler did not use terms like graph, vertex or edge. 
Today’s graph terminology did not appear until many years later. Still, this article 
by Euler was the seed from which the field of graph theory grew. 

Euler himself recognized that he was working in relatively uncharted territory. 
We close this section with the passage with which Euler opened his seminal arti- 
cle. See if you can read “graph theory” between the lines. 
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In addition to that branch of geometry which is concerned with 
magnitudes, and which has always received the greatest attention, 
there is another branch, previously almost unknown, which Leibniz 
first mentioned, calling it the geometry of position. This branch is 
concerned only with the determination of position and its properties; 
it does not involve measurements, nor calculations made with them. 
It has not yet been satisfactorily determined what kind of problems 
are relevant to this geometry of position, or what methods whould 
be used in solving them. Hence, when a problem was recently men- 
tioned [the K6nigsberg problem!], which seemed geometrical but 
was so constructed that it did not require the measurement of dis- 
tances, nor did calculation help at all, I had no doubt that it was 
concerned with the geometry of position — especially as its solu- 
tion involved only position, and no calculation was of any use. I have 
therefore decided to give here the method which I have found for 
solving this kind of problem, as an example of the geometry of posi- 
tion. [26] 


Mathematics is richer because Euler took up the problem of the K6nigsberg 
bridges. We are grateful, but perhaps not as grateful as the residents of KOnigsberg 


whose feet had grown tired from the search for a route that did not exist. 


Exercises 


1. In the context of Euler’s letter sequences, prove that ... 
(a) if a land mass L is connected to 5 bridges, then L will occur 3 times 
in any representation of a route that crosses all of the bridges once. 


(b) if aland mass L is connected to n bridges, where n is odd, then L will 
occur net times in any representation of a route that crosses all of the 


bridges once. 


2. An eighth bridge was built in K6nigsberg — an additional bridge joining 
land masses B and C. Did this addition make the desired route possible? 


Prove your answer. 


. Euler’s 1736 article included a second example of a land/bridge system (see 
Figure 1.52). Does a route exist that crosses each bridge exactly once? If 


so, give one. If not, prove it. 


. The streets in River City are shown in Figure 1.53. Is it possible for Charlie 
the anvil salesman to plan a route that covers every street exactly once? If 


so, give one. If not, prove it. 
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FIGURE 1.52. 
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FIGURE 1.53. 


1.4.2. Eulerian Trails and Circuits 


If only part of the line-system has been traversed, then every node in 
the remaining part remains even or odd, just as it was in the original 
system. 

— Carl Hierholzer [159], translated in [26] 


Recall from Section 1.1.2 that a trail in a graph is a walk that does not repeat any 
edges, and that a closed trail (one that begins and ends at the same vertex) is called 
a circuit. 

If a trail in a graph G includes every edge of G, then that trail is said to be an 
Eulerian trail. Similarly, an Eulerian circuit in a graph is a circuit that includes 
every edge of the graph. A graph that contains an Eulerian circuit is said to be an 
Eulerian graph. 

What are some examples of Eulerian graphs? The cycles, C;,, have prominent 
Eulerian circuits. The paths, P,,, have no circuits at all, and so they are certainly 
not Eulerian. Look at the graphs in Figure 1.54 and try to determine which ones, 
if any, are Eulerian. 

There are two well-known characterizations of Eulerian graphs. One involves 
vertex degrees, and the other concerns the existence of a special collection of 
cycles. The following theorem establishes both of these characterizations by as- 
serting the logical equivalence of three statements. The theorem represents work 
by Euler [98], Hierholzer [159], and Veblen [274]. 
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FIGURE 1.54. Are any of these Eulerian? 


Theorem 1.20. For a connected graph G, the following statements are equiva- 
lent. 


1. Gis Eulerian. 
2. Every vertex of G' has even degree. 
3. The edges of G can be partitioned into (edge-disjoint) cycles. 


Proof. To prove the logical equivalence of these statements, we prove that the first 
statement implies the second, the second implies the third, and the third implies 
the first. 

For the first implication, suppose that G contains an Eulerian circuit C’. Let v 
be an arbitrary vertex of G. Every time the circuit enters v on an edge, it must 
leave on a different edge. Since C' never repeats an edge, there must be an even 
number of edges incident with v and hence the degree of v is even. 

For the second implication, suppose that every vertex of G has even degree. 
We use induction on the number of cycles in G. Since G is connected and has 
no vertices of degree 1, G is not a tree, and therefore G must have at least one 
cycle. If G has exactly one cycle, the G must be a cycle graph C;, for some 
n, and so the desired partition contains just the one cycle itself. Suppose now 
(using strong induction) that the implication is valid for graphs containing at most 
k; cycles, and suppose G has k + 1 cycles. Let C’ be one of the cycles of G, 
and let G” be the graph obtained from G by deleting the edges of C. With this 
deletion, each vertex of C’ loses exactly two edges, and hence the vertices of G’ 
all have even degree. Further, the graph G’ (which is possibly disconnected) has 
connected components that have no more than k cycles each. Each component, 
then, satisfies the induction hypothesis and has edges that can be partitioned into 
cycles. These cycles, together with the cycle C’, partition the edges of G into 
cycles. The induction is complete, and the implication is established. 

For the third implication, suppose that the edges of G' can be partitioned into 
cycles. Call these cycles S;, So, ..., S,. Let C be the largest circuit in G such 
that the set of edges of C is exactly 


E(Sj,) U E(S;,) U-+-U E(S5,,) 


for some collection of the cycles S;,,.55,,..., 5;,,- (We note here that this implies 
that for each cycle S; (1 <7 < k), either all of the edges of 5; are on C or none 
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of them are.) Now, suppose e is an edge of G that is (a) not an edge of C, and (b) 
incident with a vertex, say v, that is on C. Since e is not an edge of C, it must be 
that e is an edge of cycle S;, for some 7, where no edge of S; is on C. The vertex 
v must also be on $;. Let C’ be the circuit in G obtained by patching S; into C' at 
the vertex v (since no edge of S; is a member of C, there is no repetition of edges 
caused by this patching). Since the edges of C’ consist of the edges of C' together 
with the edges of S;, we have contradicted the maximality of C’. This means that 
no such edge e can exist and therefore that C is an Eulerian circuit of G. The final 
implication is established. O 


So, Eulerian circuits exist in connected graphs precisely when the degrees of 
these graphs are all even. What about Eulerian trails? Certainly if an Eulerian 
circuit exists, then so does an Eulerian trail (the circuit is just a closed trail). But 
are there graphs which are not Eulerian but which do contain an Eulerian trail? 
The following corollary gives the complete answer. Its proof is left for you as an 
exercise. 


Corollary 1.21. The connected graph G contains an Eulerian trail if and only if 
there are at most two vertices of odd degree. 


Now that we know precisely when Eulerian circuits and trails exist, how easy 
is it to find them? The algorithm given below, named for nineteenth century math- 
ematician Carl Hierholzer [159], gives a simple way of identifying such routes. 
While not identical, you may notice a similarity between this algorithm and the 
method used to prove the third implication in the proof of Theorem 1.20. 

Before reading on, take a look back at the quotation given at the beginning 
of this section. It describes the primary reason for the success of Hierholzer’s 
algorithm. 

Hierholzer’s Algorithm for Identifying Eulerian Circuits 

Given: An Eulerian graph G. 

i. Identify a circuit in G' and call it R1. Mark the edges of Ry. Leti = 1. 


ii. If R; contains all edges of G, then stop (since R; is an Eulerian circuit). 


ili. If R; does not contain all edges of G, then let v; be a vertex on R; that is 
incident with an unmarked edge, e;. 


iv. Build a circuit, Q;, starting at vertex v; and using edge e;. Mark the edges 


of Q ie 
v. Create a new circuit, R;,1, by patching the circuit Q,; into R; at v;. 
vi. Increment 72 by 1, and go to step 11. 


An example of this process is shown in Figure 1.55. You should note that the 
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FIGURE 1.55. The stages of Hierholzer’s algorithm. 


process will succeed no matter what the initial circuit, R;, is chosen to be. Another 
algorithm for finding Eulerian circuits is given in Exercise 3. 

The even degree characterization of Eulerian graphs is really quite nice. All one 
needs to do to determine if a graph is Eulerian is simply look at the degrees of the 
vertices. Once we know a graph is Eulerian, Hierholzer’s algorithm will give us 
an Eulerian circuit. Maybe Charlie Cowell, our anvil salesman, used these ideas 
to plan his door to door routes! 


Exercises 


1. For each of the following, draw an Eulerian graph that satisfies the condi- 
tions, or prove that no such graph exists. 


(a) An even number of vertices, an even number of edges. 
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(b) An even number of vertices, an odd number of edges. 
(c) An odd number of vertices, an even number of edges. 
(d) An odd number of vertices, an odd number of edges. 


2. Use Hierholzer’s algorithm to find an Eulerian circuit in the graph of Fig- 
ure 1.56. Use R, : a,b,c, 9, f, 7,7, €, @ as your initial circuit. 


FIGURE 1.56. 


3. What follows is another algorithm (from [195]) for finding Eulerian cir- 
cuits. The method used here is to build the circuit, one edge at a time, mak- 
ing sure to make good choices along the way. 


Fleury’s Algorithm for Identifying Eulerian Circuits 


Given: An Eulerian graph G, with all of its edges unmarked. 


i. Choose a vertex v, and call it the “lead vertex.” 
ii. If all edges of G have been marked, then stop. Otherwise continue to 
step iii. 

iii. Among all edges incident with the lead vertex, choose, if possible, one 
that is not a bridge of the subgraph formed by the unmarked edges. If 
this is not possible, choose any edge incident with the lead. Mark this 
edge and let its other end vertex be the new lead vertex. 


iv. Go to step il. 


Use Fleury’s algorithm to find an Eulerian circuit for the graph in Fig- 
ure 1.57. Let a be your initial vertex. 


4. Prove Corollary 1.21. 


5. Prove that if every edge of a graph G lies on an odd number of cycles, then 
G is Eulerian. 


6. Let G be a connected graph which is regular of degree r. Prove that the line 
graph of G, L(G), is Eulerian. 
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FIGURE 1.57. 
7. Let G = Kny.no- 


(a) Find conditions on n, and nz that characterize when G will have an 
Eulerian trail. 


(b) Find conditions that characterize when G will be Eulerian. 


8. Let G = Kn,.....n,, where k > 3. 


(a) Find conditions on n1,...,7, that characterize when G will have an 
Eulerian trail. 


(b) Find conditions that characterize when G will be Eulerian. 


1.4.3. Hamiltonian Paths and Cycles 


In this new Game (... named Icosian, from a Greek work signifying 
‘twenty’) a player is to place the whole or part of a set of twenty 
numbered pieces or men upon the points or in the holes of a board 
...in such a manner as always to proceed along the lines of the 


figure... 
— from the instructions which accompanied The Icosian Game [26] 


In 1859 the English game company Jaques and Son bought the rights to manufac- 
ture and market “The Icosian Game.” The game involved twenty pegs (numbered 
1 through 20) and a gameboard with holes (see Figure 1.58). Players were re- 
quired to place the pegs in the holes in such a way that every pair of consecutive 
integers, along with the pair (1,20), were directly connected by one of the lines 
on the board. As intriguing and action-packed as the game sounds, it did not sell 
well.® 

Another version of the game had the board design situated on the exterior of 
a dodecahedron, a 12-sided solid. The object here was to find a round-trip route 
around the solid, traveling only on the edges and touching each vertex once. This 
version was named “A Voyage Round the World” since the vertices represented 
important cities of the time. Like its twin “The Icosian Game,” this game’s voyage 
was short lived. 


8Jaques and Son managed to get over this particular setback. The company, still in business today, 
had much better success in popularizing Tiddledy Winks (now Tiddly Winks), Snakes and Ladders 
(now Chutes and Ladders), and Whiff Whaff (now Table Tennis). 
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FIGURE 1.58. The Icosian Game board. 


The inventor who sold the game to Jaques and Son was the prominent math- 
ematician Sir William Rowan Hamilton. Even though his ideas did not take root 
in a recreational sense, they did become the seed for what would become a major 
branch of inquiry within the field of graph theory. Let’s take a look at some of 
these ideas. 

If a path P spans the vertices of G (that is, if V(P) = V(G)), then P is said to 
be a Hamiltonian path of G. Any graph containing a Hamiltonian path is called 
traceable. If a cycle C' spans the vertices of a graph G, such a cycle is called 
a Hamiltonian cycle, and any graph containing a Hamiltonian cycle is called, 
simply, a Hamiltonian graph. Hamiltonian graphs are clearly traceable, but the 
reverse is not always true. Look at the graphs in Figure 1.59 and try to determine 
which ones are traceable, Hamiltonian, or neither. 


G G, G 


FIGURE 1.59. Which ones are Hamiltonian? Which are traceable? 


We saw in the previous section that whether or not a connected graph was Eule- 
rian depended completely on degree parity. Unfortunately, this is not the case for 
Hamiltonicity. Hamiltonian graphs can have all even degrees (Cj), all odd de- 
grees (Ky), or a mixture (G in Figure 1.59). Similarly, non-Hamiltonian graphs 
can have varying degree parities: all even (G2 in Figure 1.59), all odd (5,7), or 
mixed (Po). 
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If degree parity does not have much to do with Hamiltonicity, then what does? 
Researchers have worked for decades on this question, and their efforts have 
produced many interesting results. A complete summary of these developments 
would require many pages,” and we do not attempt to give a thorough treatment 
here. Rather, we present several of the classic ideas and results. 

The first result that we examine is due to Dirac [77]. It does concern degrees 
in a graph — but their magnitude rather than their parity. Recall that 6(G) is the 
minimum degree of G. 


Theorem 1.22. Let G be a graph of order n > 3. If 0(G) > n/2, then G is 
Hamiltonian. 


Proof. Let G bea graph satisfying the given conditions, and suppose that G' is not 
Hamiltonian. Let P be a path in G with maximum length, and say the vertices of 
P, in order, are v1, V2, ..., Up. Because of the maximality of P, we know that all 
of the neighbors of v; and of v, are on P. And since 6(G) > n/2, each of v; and 
Up has at least n/2 neighbors on P. 

We now claim that there must exist some 7 (1 < j < p—1) such that v; € 
N(vp) and vj;41 € N(v1). Suppose for the moment that this was not the case. 
Then for every neighbor v; of v, on P (and there are at least n/2 of them), vj+41 
is not a neighbor of v;. This means that 


n n nn 
25 ee 
deg(v1) <p 5 <n 5 5, 


contradicting the fact that 6(G) > n/2. Therefore, such a 7 exists (see Fig- 


FIGURE 1.60. 


ure 1.60). 

Let C be the cycle v1, v2,..., Uj, Up, Up—1, +++, Vj+1, V1. We have assumed that 
G is not Hamiltonian, and so there must be at least one vertex of G that is not on 
P. Further, since 6(G) > n/2, we know that G is connected (see Exercise 16a 
in Section 1.1.2). Therefore there must be a vertex w not on P that is adjacent to 
a vertex, say v;, on P. But then the path in G that begins with w, travels to v;, 
and then travels around the cycle C is a longer path than our maximal path P — 
a contradiction. Our initial assumption must have been incorrect. Therefore G is 
Hamiltonian. O 


°T™ 2003 Ron Gould published an article [129] which summarized related results. His survey was 
45 pages long, and even that only covered developments that took place within the previous dozen 
years! 
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There are two interesting things to note here. First, the lower bound in this the- 
orem is best possible. To see this, consider the graph G = K;,-+41. This graph is 
not Hamiltonian (you will show this as part of Excercise 12b) and 6(G) is strictly 
between el —1and eel Second, the theorem does not provide a charac- 
terization of Hamiltonian graphs. That is, there are plenty of Hamiltonian graphs 
that have relatively small minimum (and even maximum) degree. The cycles C’, 
are obvious examples. 

Dirac’s theorem is a corollary to the following general result of Ore [217]. The 
proof of Ore’s theorem is similar to that of the above, and it is left for you as an 
exercise. 


Theorem 1.23. Let G be a graph of order n > 3. If deg(x) + deg(y) > n for all 
pairs of nonadjacent vertices x, y, then G is Hamiltonian. 


A set of vertices in a graph is said to be an independent set of vertices if they 
are pairwise nonadjacent. The independence number of a graph G, denoted by 
a(G), is defined to be the largest size of an independent set of vertices from G. 
As an example, consider the graphs in Figure 1.61. The only independent set of 
size 2 in G, is {c,d}, so a(G,) = 2. There are two independent sets of size 3 in 
G2: {a,c,e} and {b, d, f}, and none of size 4, so a(G2) = 3. 


. a b 
G, G, 
e b 
f c 
d c : d 
FIGURE 1.61. 


The next theorem, due to Chvatal and Erdés [54], relates Hamiltonicity to in- 
dependence number and connectivity. Before stating and proving this result, let 
us introduce some helpful notation. If x and y are vertices on a path P, let |x, y] p 
denote the portion of P that runs from x to y. Further, given a cycle C’ with its 
vertices labeled in a specific orientation (say, clockwise), let [x, y]¢+ denote the 
portion of C' that runs clockwise from x to y. Similarly, [a, y|¢— would denote 
the portion of C' that runs counter-clockwise from x to y. 


Theorem 1.24. Let G be a connected graph of order n > 3 with vertex connec- 
tivity K(G) and independence number a(G). If K(G) > a(G), then G is Hamil- 
tonian. 


Proof. If G is as described, then «(G) > 2 — for if K(G) = 1, then a(G) = 1 
and thus G is either Ky, or Ke, contradicting the fact that n > 3. 

Let C be a longest cycle in G. Suppose that C’ is not a Hamiltonian cycle, and 
let v be a vertex of G that is not on C’. Let H be the connected component of 
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G — V(C) that contains v. Let c1, co, ..., c, be the vertices of C' that are adjacent 
to some vertex of H, and suppose that these vertices are labeled in a clockwise 
direction around the cycle C’. For each 7 (1 <i <r), let h; be a vertex of H that 
is adjacent to c;, and let d; be the immediate (clockwise) successor of c; on C. 

We now observe several things. First, it must be that r > «(G). If the vertices 
Ci, CQ, ..., Cr were removed from G, then H would be disconnected from the 
rest of the graph. Since «(G) is the size of the smallest cut set, it follows that 
r > «(G). The observation in the first paragraph then implies that r > 2. 

Second, no two of the vertices in the set {c1,c2,...,¢,} are consecutive ver- 
tices on C. To see this, suppose that there is some 7 such that c; and c,+1 are 
consecutive vertices on C’. Let P be a path from h, to h;+1 in H, and consider the 
cycle formed by replacing the edge c;c;+1 on C with the path c;, [hi, higilp, ci41- 
This cycle is longer than our maximal cycle C, a contradiction. This observation 
implies that the sets {c1, co,...,c,} and {d1, do,...,d,-} are disjoint. 

Third, for each i (1 < 7 < r), d; is not adjacent to v. To see this, suppose 
djvu € E(G) for some i, and let Q be a path from h; to v in H. In this case, 
the cycle formed by replacing the edge c,d; on C with the path c;, [hi, v]g, di is 
longer than C, again a contradiction. 

Now, let S = {v,di,do,...,d,}. The first observation above implies that 
|S| > «(G)+1 > a(G). This means that some pair of vertices in S must be adja- 
cent. Our third observation implies that d; must be adjacent to d; for some i < j. 
If Ris a path from h; to h; in H, then the cycle c;, [hi, hy| Rr, [e;, dilo-, [dj, clot 
is a longer cycle than C (see Figure 1.62). Our assumption that C’ was not a Hamil- 


FIGURE 1.62. 


tonian cycle has led to a contradiction. Therefore G' is Hamiltonian. O 


As was the case for Dirac’s theorem, the inequality in this theorem is sharp. 
That is, graphs G where «(G) > a(G) — 1 are not necessarily Hamiltonian. 
The complete bipartite graphs K,,.; provide proof of this. The Petersen graph, 
shown in Figure 1.63, is another example.!° 


!0The Petersen graph is well known among graph theorists for its surprising connections to many 
areas of the field, and for its penchant for being a counterexample to many conjectures. You have seen 
this graph already — it is the graph you should have obtained as the complement of the line graph of 
Ks in Exercise 7b of Section 1.1.3. 
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FIGURE 1.63. The Petersen Graph. 


The next theorem belongs to a category of results that relate Hamiltonicity to 
forbidden subgraphs. Given graphs G and H, if G does not contain a copy of H as 
an induced subgraph, then we say that G is H-free. If S is a collection of graphs, 
and if G does not contain any of the graphs in S' as induced subgraphs, then we 
say that G' is S-free. 

In 1974, Goodman and Hedetniemi [127] noticed something regarding two of 
the graphs shown in Figure 1.64. 


N 


FIGURE 1.64. Three special graphs. 


Theorem 1.25. If G is a 2-connected, { K 1,3, Z:}-free graph, then G is Hamilto- 


nian. 


Proof. Suppose G' is 2-connected and {K1,3, Z;}-free, and let C' be a longest 
cycle in G (we know that G contains at least one cycle — see Exercise 14 in 
Section 1.1.2). If C is not a Hamiltonian cycle, then there must exist a vertex v, 
not on C, which is adjacent to a vertex, say w, on C’. Let a and b be the immediate 
predecessor and successor of w on C. 

A longer cycle would exist if either a or b were adjacent to v, and so it must 
be that both a and b are nonadjacent to v. Now, if a is not adjacent to b, then 
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the subgraph induced by the vertices {w, v, a, b} is Ky,3, and we know that G is 
K,3-free. So it must be that ab € E(G). But if this is the case, then the subgraph 
induced by {w,v,a,b} is Z, a contradiction. Therefore, it must be that C' is a 
Hamiltonian cycle. O 


The fact that this result requires 2-connectivity should not be surprising, for 
2-connectivity is required for all Hamiltonian graphs. As you will prove in Exer- 
cise 2, graphs G where «(G) = 1 cannot have a spanning cycle. 

Another classic forbidden subgraph theorem involves /‘; 3 and the third graph 
shown in Figure 1.64. This is a result of Duffus, Gould, and Jacobson [81]. 


Theorem 1.26. Let G be a { K1,3, N}-free graph. 
1. If G is connected, then G is traceable. 
2. If G is 2-connected, then G is Hamiltonian. 


It is interesting to note that the graph (13 is involved in both of these theorems. 
This graph, affectionately referred to as the “claw,” appears in many forbidden 
subgraph results. Claw-free graphs have received a great deal of attention in recent 
years, especially within the context of Hamiltonicity problems. The claw will bare 
itself again in the next section in the context of unsolved problems. 


Exercises 


1. Give a solution to The Icosian Game. 

2. Prove that if G is Hamiltonian, then G' is 2-connected. 

3. Prove Theorem 1.23. 

4. Give the connectivity and independence number of the Petersen graph. 


5. Prove or disprove: The independence number of a bipartite graph is equal 
to the cardinality of one of its partite sets. 


6. Prove that if G is of order n and is regular, then a(G) < n/2. 


7. Prove that each of the 18-vertex graphs in Figure 1.65 is 2-connected, claw- 
free and nontraceable.'! 


8. For any graph G, prove that the line graph L(G) is claw-free. 
9, Let G be a K3-free graph. Prove that its complement, G, is claw-free. 
10. Let G be a graph and let S be a nonempty subset of V(G). 
‘Tn [154] the authors show that 2-connected, claw-free graphs of order less than 18 are traceable. 


They also show that the graphs in Figure 1.65 are the only 2-connected, claw-free, nontraceable graphs 
with order 18 and size at most 24. 
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FIGURE 1.65. 


(a) Prove that if G is Hamiltonian, then G — S' has at most |.S'| connected 
components. 


(b) Prove that if G is traceable, then G — S has at most |S] + 1 connected 
components. 
11. Prove that if G is Eulerian, then L(G) is Hamiltonian. 
12. LetG = Kn, no. 


(a) Find conditions on n, and nz that characterize the traceability of G. 


(b) Find conditions that characterize the Hamiltonicity of G. 
13. Let n be a positive integer. 


(a) Prove that Ky 2n,3n 18 Hamiltonian. 


(b) Prove that Ky, 2n,3n+1 is not Hamiltonian. 


1.4.4 Three Open Problems 


Nothing can stop the claw! 
— Fletcher Reede, Liar, Liar 


We close our discussion of paths and cycles with several questions. These prob- 
lems were posed years ago, and they have received a great deal of attention. While 
there has been progress on each of them, the original questions remain unan- 
swered. 


Intersecting Detour Paths 


Given a graph G, the detour order of G, denoted 7(G), is the number of vertices 
in a longest path in G. If a path P in G has 7(G) vertices, then we call P a detour 
pathinG. 

In Exercise 8 of Section 1.1.2, you proved that if G is a connected graph and 
if P, and P, are detour paths, then the interesection V(P,) M V(P2) must be 
nonempty. As you (hopefully!) saw, this result is not terribly difficult to prove. 
Consider the following question. 
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Question A: If G is connected and P,, P2, and P3 are detour paths in 
G, then must the intersection V(P,) NV (P2) 1V(P3) be nonempty? 


As you see, the only difference between this question and the earlier exercise is 
that this one involves three paths rather than two. But this difference makes all the 
difference, because Question A remains unsolved! 

The origin of this question can be traced to a related question asked by Gallai 
[118] in 1966: Is it true that in every connected graph there is at least one vertex 
that lies on every detour path? In 1969, Walther [278] gave a negative answer to 
the question in the form of the graph in Figure 1.66. The detour order of this graph 


FIGURE 1.66. Walther’s example. 


is 21, and every vertex is missed by at least one of the detour paths. Within the 
next several years, Walther and Zamfirescu, working independently (see [141], 
[279], [290]), had produced a smaller example, the graph in Figure 1.67. This 


FIGURE 1.67. A smaller example, given by Walther and Zamfirescu. 


graph has 12 vertices, has detour order 10, and every vertex is missed by at least 
one of the detour paths. The graph in Figure 1.67 is the smallest known graph 
where the detour paths have an empty intersection. 

Consider the following more general version of Question A. 
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Question B: If G is connected and P,,...,P,, are distinct detour 
n 


paths in G,, then must the intersection () V(P;) be nonempty? 
i=1 


The graph of Figure 1.67 demonstrates that for mn = 12, the answer to Question B 
is no. 
In 1975, Schmitz [250] presented the graph in Figure 1.68. The detour order 


FIGURE 1.68. Example given by Schmitz. 


of this graph is 13. There are exactly seven detour paths, and every vertex of the 
graph is missed by at least one of these paths. This tells us that for n = 7, the 
answer to Question B is no. 

We have already mentioned that for n = 3, the answer to Question B is un- 
known. The same is true for n = 4,5 and 6. When asked, most researchers would 
probably lean toward believing the result to be true for n = 3, although no proof 
is known as of yet. For now, it is simply a conjecture. 


Conjecture 1. /f G is connected, then the intersection of any three distinct detour 
paths in G is nonempty. 


Matthews and Sumner’s Conjecture 


We met the claw, 1.3, in the previous section. We saw two results in which the 
claw, when forbidden with another graph, implied Hamiltonicity in 2-connected 
graphs. There are other such pairs. In [20], [39], and [130], the respective authors 
showed that the pairs {k1.3,W}, {Ki.3, Ps}, and {K13, Z2} (see Figure 1.69) 
all imply Hamiltonicity when forbidden in 2-connected graphs. Do you see a pat- 


W Z, 
FIGURE 1.69. Two additional forbidden subgraphs. 
tern here? The claw seems to be prominent in results like this. In 1997, Faudree 


and Gould [102] showed that this was no coincidence. The graph N appears in 
Figure 1.64. 
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Theorem 1.27. If being {R,S}-free (where R, S are connected and neither is 
P3) implies that 2-connected graphs are Hamiltonian, then one of R, S is the 
claw, and the other is an induced subgraph of Pe, Z2, W, or N. 


Is the claw powerful enough to imply Hamiltonicity when forbidden by itself? 
Well, as you will prove in the exercises, the graph P3 is the only connected graph 
that, when forbidden, implies Hamiltonicity in 2-connected graphs. But what if 
the level of connectivity is increased? 


Question C: If G is claw-free and k-connected (k > 3), must G' be 
Hamiltonian? 


12 


The graph in Figure 1.70 is 3-connected, claw-free and non-Hamiltonian, “ and 


so the answer to Question C for k = 3 is no. 


FIGURE 1.70. A lovely example. 


In 1984 Matthews and Sumner [199] made the following, still unresolved, con- 
jecture. 


Conjecture 2. [f G is 4-connected and claw-free, then G is Hamiltonian. 


There has been some progress with regard to this conjecture. Most notably, in 
1997 Ryjaéek [245] proved the following theorem. 


Theorem 1.28. [f G is 7-connected and claw-free, then G is Hamiltonian. 


At this time, Question C for k = 4,5 and 6 is still unanswered. 


This graph, demonstrated by Matthews ([198], see also [199]) in 1982, is the smallest such graph, 
and it is the line graph of the graph obtained from the Petersen graph (what else?) by replacing each 
of the five “spoke” edges with a P3. 
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The Path Partition Conjecture 


Recall that 7(G), the detour order of G,, is the number of vertices in a longest path 
of G. Recall also that given a subset S of V(G), the notation (S') represents the 
subgraph of G induced by S.. 

Given a graph G and positive integers a and b, if the vertices of G can be 
partitioned into two sets A and B in such a way that 7((A)) < aand7((B)) < 6, 
then we say that G has an (a, b)-partition. 

As an example, consider the graph G' in Figure 1.71. The partition (Ai, B,), 


A, ={abodh,i} Ay = {c,d h, i} 
B= teh gik U B= {abefgik tl 


FIGURE 1.71. Partitioning the vertex set. 


where A; = {a,b,c,d,h,i} and B, = f{e,f,g,j9,k,l}, is not a valid (4, 7)- 


partition. This is because every longest path in both (A;) and (B;) has 6 vertices. 
On the other hand, the partition (Az, Bz), where 


Ag = {c,d, h,t}, Be = 405 0;6; 7,059; k,l} 


is a valid (4, 7)-partition, since T((A2)) < 4 and T((B2)) < 7. 

If a graph G has an (a, b)-partition for every pair (a, b) of positive integers such 
that a + b = 7(G), then we say that G is T-partitionable. In order to show that 
the graph in Figure 1.71 is 7-partitionable (since its detour order is 11), we would 
also need to show that the graph had (1, 10)-, (2, 9)-, (3, 8)-, and (5, 6)-partitions. 
Another way to demonstrate this would be to prove the following, still unresolved, 
conjecture. 


The Path Partition Conjecture. Every graph is T-partitionable. 
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The Path Partition Conjecture was first mentioned by Lovasz and Mihok in 
1981, and it has received a great deal of attention since then. Much of the progress 
has been the establishment that certain families of graphs are 7T-partitionable. The 
following theorem gives a small sampling of these results. 


Theorem 1.29. The graph G is T-partitionable if any one of the following is true: 
I. A(G) < 3; (follows from [189]) 
2. A(G) => |V(G)| — 8; (follows from [38]) 
3. 7(G) < 13; [83] 
4. 7(G) > |V(G)| — 1. [38] 


So, if the max degree of a graph G is either very small or relatively large, or 
if the longest path in a graph is either rather short or relatively long, then G is 
T-partitionable. 

In 2007, Dunbar and Frick [84] proved the following interesting result. 


Theorem 1.30. [f G is claw-free, then G is T-partitionable. 


There is the claw again! In the same article, the authors prove that in order to 
prove that the Path Partition Conjecture is true, it is sufficient to prove that every 
2-connected graph is T-partitionable. 

With each new result, researchers add to the arsenal of weapons that can be used 
to attack the Path Partition Conjecture. So far, though, the conjecture is holding 
strong. 

With all of this era’s computing power, how can it be that the Path Partition 
Conjecture and the other conjectures in this section remain unsolved? Computers 
are now doing more things than ever, faster than ever, so why can’t we just get 
a computer cranking away at these problems? These are reasonable questions to 
ask. But the conjectures here are not really questions of computation. These are 
problems that will require a combination of insight, cleverness and patience. 


Exercises 


1. If the word “connected” were removed from Conjecture |, could you settle 
the resulting conjecture? 


2. Show that every vertex of the graph in Figure 1.67 is missed by at least one 
detour path in the graph. 


3. In the graph of Figure 1.68, find the seven distinct detour paths and show 
that they have an empty intersection. 


4. Show that if G is 2-connected and P3-free, then G is Hamiltonian. 


5. Show that if being H-free implies Hamiltonicity in 2-connected graphs 
(where H is connected), then H is P3. 
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6. Verify that the graph in Figure 1.70 is actually the line graph of the graph 
obtained from the Petersen graph by replacing each of the five “spoke” 
edges with a P3. 


7. Prove that the graph in Figure 1.67 is T-partitionable by listing all necessary 
partitions. 


8. Prove that all bipartite graphs are 7-partitionable. 
9. Prove that all traceable graphs are 7-partitionable. 
10. Prove that every graph G has a (1, 7(G) — 1) partition. 


11. Prove that if a graph is (1, 1)-partitionable, then it is (a, b)-partitionable for 
all positive integers a and b. 


12. Show that all graphs are (a, b)-partitionable when a < 3. 


13. EXTRA CREDIT: Settle any of the conjectures in this section. 


1.5 Planarity 


Three civil brawls, bred of an airy word 
By thee, old Capulet, and Montague, 
Have thrice disturb’d the quiet of our Streets ... 
— William Shakespeare, Romeo and Juliet 


The feud between the Montagues and the Capulets of Verona has been well doc- 
umented, discussed, and studied. A fact that is lesser known, though, is that long 
before Romeo and Juliet’s time, the feud actually involved a third family—the 
Hatfields.'? The families’ houses were fairly close together, and chance meetings 
on the street were common and quite disruptive. 

The townspeople of Verona became very annoyed at the feuding families. They 
devised a plan to create separate, nonintersecting routes from each of the houses to 
each of three popular places in town: the square, the tavern, and the amphitheater. 
They hoped that if each family had its own route to each of these places, then the 
fighting in the streets might stop. 

Figure 1.72 shows the original layout of the routes. Try to rearrange them so 
that no route crosses another route. We will come back to this shortly. 


'3The Hatfields eventually grew tired of feuding, and they left Verona in search of friendlier terri- 
tory. They found a nice spot in the mountains of West Virginia, right across the river from a really nice 
family named McCoy. 
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FIGURE 1.72. Original routes. 


1.5.1 Definitions and Examples 


Define, define, well-educated infant. 
— William Shakespeare, Love’s Labour’s Lost 


A graph G is said to be planar if it can be drawn in the plane in such a way that 
pairs of edges intersect only at vertices, if at all. If G has no such representation, 
G is called nonplanar. A drawing of a planar graph G in the plane in which edges 
intersect only at vertices is called a planar representation (or a planar embedding) 
of G. 

Figure 1.73 shows examples of planar graphs. Notice that one of the drawings 


FIGURE 1.73. Examples of planar graphs. 


is not a planar representation—try to visualize untangling it. 

Proving a graph to be planar is in some cases very simple—all that is required 
is to exhibit a planar representation of the graph. This is certainly quite easy to do 
with paths, cycles, and trees. What about complete graphs? K,, K2, and K3 are 
clearly planar; Figure 1.74 shows a planar representation of K’4. We will consider 
Ks shortly. 


FIGURE 1.74. A planar representation of Ky. 
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The Montague/Capulet/Hatfield problem essentially amounts to finding a pla- 
nar representation of 3 3. Unfortunately, the townspeople of Verona just had to 
learn to deal with the feuding families, for A3,3 is nonplanar, and we will see an 
explanation shortly. 

What is involved in showing that a graph G is nonplanar? In theory, one would 
have to show that every possible drawing of G is not a planar representation. Since 
considering every individual drawing is out of the question, we need some other 
tools. 

Given a planar representation of a graph G, a region is a maximal section of 
the plane in which any two points can be joined by a curve that does not intersect 
any part of G. 

Informally, if a cookie cutter has the shape of a planar representation of G, 
then the cookies are the regions (see Figure 1.75). The big region, R7, is called 
the exterior (or outer) region. 


R, 


NOV 


FIGURE 1.75. Six small cookies and one very large cookie. 


It is quite natural to think of the regions as being bounded by the edges. A 
single edge can come into contact with either one or two regions. In Figure 1.76, 
edge e; is only in contact with one region, 5, and edges e2 and e3 are only in 
contact with Sz. Each of the other edges in Figure 1.76 is in contact with two 
regions. Let us say that an edge e bounds a region R if e comes into contact with 


FIGURE 1.76. Edges e1, e2, and e3 touch one region only. 
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FR and with a region different from f. Define the bound degree of R, denoted by 
b( R), to be the number of edges that bound region R. For example, in Figure 1.75, 
Figure 1.76, b(S;) = 6(.S3) = 3 and b(.S2) = 6. Note that in this graph, the edges 
€1, €2, and e3 do not contribute to the bound degree of any region. 

Figure 1.77 displays six planar graphs along with the numbers of vertices, 
edges, and regions. Before continuing to the next section, study these numbers 
and try to find a pattern. You might also notice that two of drawings are actually 
the same graph. This brings up an important point: The number of regions in a 
planar representation of a graph does not depend on the representation itself! 


~.2 3 
ll 
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FIGURE 1.77. Is there a pattern? 


Exercises 
1. Find planar representations for each of the planar graphs in Figure 1.78. 


2. Give planar representations of the graph in Figure 1.79 such that each of 
the following is the exterior region. 


(a) Ry 
(b) Re 
(c) Rg 
(d) R4 
(e) Rs 
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FIGURE 1.78. 


FIGURE 1.79. 


3. Explain why embedding a graph in a plane is essentially the same as em- 
bedding a graph on a sphere. 


4. Write a nice proof of the fact that every tree is planar. 
5. Draw a planar graph in which every vertex has degree exactly 5. 


6. Suppose that e is a bridge of a planar graph G. Prove that e does not bound 
a region in any planar representation of G. 


7. In [101] and [277], Fary and Wagner proved independently that every pla- 
nar graph has a planar representation in which every edge is a straight line 
segment. Find such a representation for the graph in Figure 1.80. 


FIGURE 1.80. 


8. If planar graphs G; and G»2 each have n vertices, g edges, and r regions, 
must the graphs be isomorphic? Justify your answer with a proof or a coun- 
terexample. 
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1.5.2 Euler’s Formula and Beyond 


Now I will have less distraction. 
— Leonard Euler, upon losing sight in his right eye [100] 


Euler discovered a relationship between the numbers of vertices, edges, and re- 
gions of a graph, and his discovery is often called Euler’s Formula [99]. 


Theorem 1.31 (Euler’s Formula). Jf G is a connected planar graph with n ver- 
tices, q edges, and r regions, then 


n-qtr=2. 


Proof. We induct on q, the number of edges. If g = 0, then G must be Kj, a 
graph with | vertex and | region. The result holds in this case. Assume that the 
result is true for all connected planar graphs with fewer than g edges, and assume 
that G has q edges. 

Case 1. Suppose G is a tree. We know from our work with trees that ¢g = n — 1; 
and of course, r = 1, since a planar representation of a tree has only one region. 
Thusn —q+r=n—(n—1)+1 = 2, and the result holds. 

Case 2. Suppose G is not a tree. Let C' be a cycle in G, let e be an edge of C, 
and consider the graph G — e. Compared to G, this graph has the same number 
of vertices, one edge fewer, and one region fewer, since removing e coalesces two 
regions in G' into one in G — e. Thus the induction hypothesis applies, and in 
G—e, 


n—(q-1)+(r—1) =2, 


implying thatn —q+r=2. 
The result holds in both cases, and the induction is complete. oO 


Euler’s Formula is useful for establishing that a graph is nonplanar. 


Theorem 1.32. [3 3 is nonplanar. 


Proof. Suppose that ‘3.3 were planar and that we had a planar representation. 
Since n = 6 and gq = 9, Euler’s Formula implies that such a planar representation 
of 3.3 would have r = 5 regions. Now consider the sum 


c= 5 WR), 
R 


where the sum is over all regions FR in the representation of the graph. Since every 
edge of G'can be on the boundary of at most two regions, we get C < 2q = 18.On 
the other hand, since each region of K3 3 has at least four edges on the boundary 
(there are no triangles in bipartite graphs), we see that C > 4r = 20. We have 
reached a contradiction. Therefore, /¢3 3 is nonplanar. O 


Theorem 1.33. [f G is a planar graph with n > 3 vertices and q edges, then 
q < 3n—6. Furthermore, if equality holds, then every region is bounded by three 
edges. 
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Proof. Again consider the sum 
C= S°d(R). 
R 


As previously mentioned, C' < 2q. Further, since each region is bounded by at 
least 3 edges, we have that C’ > 3r. Thus 


Br<2qg => 3(24+q-—n)<2¢ > q<3n-6. 


If equality holds, then 3r = 2q, and it must be that every region is bounded by 
three edges. O 


We can use Theorem 1.33 to establish that As is nonplanar. 


Theorem 1.34. Ks is nonplanar. 


Proof. Ks has 5 vertices and 10 edges. Thus 3n — 6 = 9 < 10 = gq, implying 
that Ks is nonplanar. O 


Exercise 5 in Section 1.5.1 asked for a planar graph in which every vertex has 
degree exactly 5. This next result says that such a graph is an extreme example. 


Theorem 1.35. /fG is a planar graph, then G contains a vertex of degree at most 
five. That is, 0(G) < 5. 


Proof. Suppose G has n vertices and q edges. If n < 6, then the result is imme- 
diate, so we will suppose that n > 6. If we let D be the sum of the degrees of the 
vertices of G’, then we have 


D = 2q < 2(3n — 6) = 6n — 12. 


If each vertex had degree 6 or more, then we would have D > 6n, which 
is impossible. Thus there must be some vertex with degree less than or 
equal to 5. O 


Exercises 


1. G is a connected planar graph of order 24, and it is regular of degree 3. 
How many regions are in a planar representation of G? 


2. Let G be a connected planar graph of order less than 12. Prove 6(G) < 4. 
3. Prove that Euler’s formula fails for disconnected graphs. 


4. Let G be a connected, planar, ’3-free graph of order n > 3. Prove that G 
has no more than 2n — 4 edges. 


5. Prove that there is no bipartite planar graph with minimum degree at 
least 4. 
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6. Let G be a planar graph with & components. Prove that 
n—-qtr=1+k. 
7. Let G be of order n > 11. Show that at least one of G and G is nonplanar. 


8. Show that the average degree (see Exercise 11 in Section 1.3.2) of a planar 
graph is less than six. 


9. Prove that the converse of Theorem 1.33 is not true. 
10. Find a 4-regular planar graph, and prove that it is unique. 


11. A planar graph G is called maximal planar if the addition of any edge to G 
creates a nonplanar graph. 


(a) Show that every region of a maximal planar graph is a triangle. 


(b) If a maximal planar graph has order n, how many edges and regions 
does it have? 


1.5.3. Regular Polyhedra 


We are usually convinced more easily by reasons we have found 
ourselves than by those which have occurred to others. 
— Blaise Pascal, Pensées 


A polyhedron is a solid that is bounded by flat surfaces. Dice, bricks, pyramids, 
and the famous dome at Epcot Center in Florida are all examples of polyhedra. 
Polyhedra can be associated with graphs in a very natural way. Think of the poly- 
hedra as having faces, edges, and corners (or vertices). The vertices and edges of 
the solid make up its skeleton, and the skeleton can be viewed as a graph. An in- 
teresting property of these skeleton graphs is that they are planar. One way to see 
this is to imagine taking hold of one of the faces and stretching it so that its edges 
form the boundary of the exterior region of the graph. The regions of these pla- 
nar representations directly correspond to the faces of the polyhedra. Figure 1.81 
shows a brick-shaped polyhedron, its associated graph, and a planar representa- 
tion of the graph. 


FIGURE 1.81. A polyhedron and its graph. 


Because of the natural correspondence, we are able to apply some of what we 
know about planar graphs to polyhedra. The next theorem follows directly from 
Euler’s Formula for planar graphs, Theorem 1.31. 
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Theorem 1.36. [fa polyhedron has V vertices, E edges, and F faces, then 
V-E+F=2. 


This next theorem is similar to Theorem 1.35. 
Given a polyhedron P, define p(P) to be 


p(P) = min{b(R) | R is aregion of P}. 
Theorem 1.37. For all polyhedra P, 3 < p(P) <5. 


Proof. Since one or two edges can never form a boundary, we know that p(P) > 
3 for all polyhedra P. So we need to prove only the upper bound. 

Let P be a polyhedron and let G be its associated graph. Suppose P has V 
vertices, FE’ edges, and F’ faces. For each k, let V;, be the number of vertices of 
degree k, and let Fj, be the number of faces of P (or regions of G) of bound degree 
k. From our earlier remarks, if k < 3, then Vi = Fy, = 0. Since every edge of P 
touches exactly two vertices and exactly two faces, we find that 


SO kV, = 2E = So kFy. 
k>3 k>3 
If every face of P were bounded by 6 or more edges, then we would have 
2B = SM > 6h = 6 Fi =6P, 
k>3 k>6 k>6 


implying that & > 3’. Furthermore, 


of =) hve > BV, 
k>3 


implying that V < ZEB. Thus 


a 
E=V+F-2<7E+,E-2=E-2, 


and this, of course, is a contradiction. Therefore, some face of P is bounded by 
fewer than 6 edges. Hence, p(P) <5. O 


We now apply this result to derive a geometric fact known to the ancient Greeks. 

A regular polygon is one that is equilateral and equiangular. We say a poly- 
hedron is regular if its faces are mutually congruent, regular polygons and if the 
number of faces meeting at a vertex is the same for every vertex. The cube, whose 
faces are congruent squares, and the tetrahedron, whose faces are congruent equi- 
lateral triangles, are regular polyhedra. A fact that has been known for at least 
2000 years is that there are only five regular polyhedra: the tetrahedron, the cube, 
the octahedron, the dodecahedron, and the icosahedron (see Figure 1.82). We can 
use a graph-theoretic argument to prove this. 
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Tetrahedron JN 
Cube 


Dodecahedron eA 


Icosahedron 


FIGURE 1.82. The five regular polyhedra and their graphical representations. 


Theorem 1.38. There are exactly five regular polyhedra. 


Proof. Let P be a regular polyhedron, and let G' be its associated planar graph. 
Let V, &, and F’ be the number of vertices, edges, and faces (regions) of P. Since 
the faces of P are congruent, each is bordered by the same number of edges, say 
k. Theorem 1.37 tells us that 3 < k& < 5. Further, since the polyhedron P is 
regular, it follows that the graph G is also regular. Let us say that G' is regular of 
degree r where r > 3. From Theorem 1.37, we obtain rV = 2E = kF. Now, 
Theorem 1.36 implies that 


8=4V —4E+4+4F 
=4V —-2E+4+4F—-2E 
=4V —-rV+4F—-kF 
=(4-r)V+(4-k)F. 
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V and E are of course both positive, and since 3 < k < 5 andr > 3, there are 
only five possible cases. 

Case 1. Suppose r = 3 and k& = 3. In this case, V = F and8 = V4 F, 
implying that V = F' = 4. This is the tetrahedron. (The fact that the tetrahedron 
is the only regular polygon with V = F' = 4 is based on a geometrical argument. 
This applies to the remaining four cases as well.) 

Case 2. Suppose r = 3 and k = 4. Here we have V = 8 and 3V = 4F. Thus 
F = 6, and P is acube. 

Case 3. Suppose r = 3 and k = 5. In this case we have 8 = V — F' and 3V = 
5F’. Solving this system yields V = 20 and F’ = 12. This is a dodecahedron. 

Case 4. Suppose r = 4 and k = 3. Here we have F' = 8 and 4V = 3F’. Thus 
V = 6 and P is an octahedron. 

Case 5. Suppose r = 5 and k = 3. In this case we have 8 = —V + F'and5V = 
3F’. Solving this system yields V = 12 and F' = 20. This is an icosahedron. O 


Exercises 


1. (From [52].) Show that the octahedron is a complete multipartite graph: 
Ky, ,....r, for some n and for some values 71, ..., Tn. 


2. Find an example of a polyhedron different from the ones discussed in this 
section. Sketch the polyhedron, and draw the associated graph. 


3. See if you can find an alternative proof (not necessarily graph-theoretic) of 
the fact that there are only five regular polyhedra. 


1.5.4 Kuratowski’s Theorem 


...@ pair so famous. 
— William Shakespeare, Anthony and Cleopatra 


Our goal in this section is to compile a list of all nonplanar graphs. Since the 
list will be infinite (and since this book is not), we will make use of a clever 
characterization due to Kuratowski. 

We have already established that both A’3 3 and A are nonplanar, so we should 
put them at the top of our list. What other graphs should we include? Suppose 
G is a graph that contains K’33 as a subgraph. This graph G' would have to be 
nonplanar, since a planar representation of it would have to contain a planar rep- 
resentation of /¢3.3. So we can add to our list of nonplanar graphs all graphs that 
contain /¢3 3 or 5 as a subgraph. 

The graph in Figure 1.83 shows us that our list of nonplanar graphs is not 
yet complete. This graph is not planar, but it does not contain Ks or 33 as a 
subgraph. Of course, if we were to replace the two edges labeled a and b with a 
single edge e, then the graph would contain (5 as a subgraph. This motivates the 
following definition. 
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FIGURE 1.83. 


Let G be a graph. A subdivision of an edge e in G is a substitution of a path for 
e. We say that a graph H is a subdivision of G if H can be obtained from G by a 
finite sequence of subdivisions. 

For example, the graph in Figure 1.83 contains a subdivision of Ks, and in 
Figure 1.84, H is a subdivision of G. 


G jel 


FIGURE 1.84. A graph and a subdivision. 


We leave the proof of the following theorem to the exercises (see Exercise 1). 
Theorem 1.39. A graph G is planar if and only if every subdivision of G is planar. 


Our list of nonplanar graphs now includes (3.3, Ks, graphs containing [’3.3 or 
Ks as subgraphs, and all graphs containing a subdivision of K3.3 or K's. The list 
so far stems from only two specific graphs: Ks 3 and Ks. A well-known theorem 
by Kuratowski [185] tells us that there are no other graphs on the list! The bottom 
line is that K3 3 and Ks are the only two real enemies of planarity. 

Kuratowski proved this beautiful theorem in 1930, closing a long-open prob- 
lem.'* In 1954, Dirac and Schuster [78] found a proof that was slightly shorter 
than the original proof, and theirs is the proof that we will outline here. 


Theorem 1.40 (Kuratowski’s Theorem). A graph G is planar if and only if it 
contains no subdivision of 3,3 or Ks. 


Sketch of Proof 

We have already discussed that if a graph G is planar, it contains no subgraph 
that is a subdivision of K3.3 or Ks. Thus we need to discuss only the reverse 
direction of the theorem. 


'4We should note here that Frink and Smith also discovered a proof of this fact in 1930, indepen- 
dently of Kuratowski. Since Kuratowski’s result was published first, his name has traditionally been 
associated with the theorem (and the names Frink and Smith have traditionally been associated with 
footnotes like this one.) 
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Suppose G is a graph that contains no subdivision of K’3,3 or A’5. Here are the 
steps that Dirac and Schuster used to prove the result. 


1. Prove that G' is planar if and only if each block of G is planar. (A block of 
G is a maximal connected subgraph of G' that has no cut vertex). 

2. Explain why it suffices to show that a block is planar if and only if it con- 
tains no subdivision of 13,3 or Ks. Assume that G is a block itself (con- 
nected with no cut vertex). 

3. Suppose that G is a nonplanar block that contains no subdivision of [3 3 
or Ks (and search for a contradiction). 

4. Prove that 6(G) > 3. 

5. Establish the existence of an edge e = wv such that the graph G — e is also 
a block. 

6. Explain why G — e is a planar graph containing a cycle C’ that includes both 
u and v, and choose C' to have a maximum number of interior regions. 

7. Establish several structural facts about the subgraphs inside and outside the 
cycle C. 

8. Use these structural facts to demonstrate the existence of subdivisions of 
3,3 or Ks, thus establishing the contradiction. 

Exercises 

1. Prove that a graph G is planar if and only if every subdivision of G is planar. 

2. Use Kuratowski’s Theorem to prove that the Petersen graph (Figure 1.63) 
is nonplanar. 

3. Prove the first step of the proof of Kuratowski’s Theorem. 

4. Determine all complete multipartite graphs (of the form K;,.,,___,,,) that are 
planar. 

1.6 Colorings 
One fish, two fish, red fish, blue fish. 
— Dr. Seuss 


The senators in a particular state sit on various senate committees, and the com- 
mittees need to schedule times for meetings. Since each senator must be present 
at each of his or her committee meetings, the meeting times need to be sched- 
uled carefully. One could certainly assign a unique meeting time to each of the 
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committees, but this plan may not be feasible, especially if the number of commit- 
tees is large. We ask ourselves, given a particular committee structure, what is the 
fewest number of meeting times that are required? We can answer this question 
by studying graph coloring. 


1.6.1 Definitions 


Given a graph G' and a positive integer k, a k-coloring is a function K : V(G) > 
{1,...,} from the vertex set into the set of positive integers less than or equal 
to k. If we think of the latter set as a set of k “colors,” then K is an assignment of 
one color to each vertex. 

We say that K is a proper k-coloring of G if for every pair u,v of adjacent 
vertices, K(u) # K(v) — that is, if adjacent vertices are colored differently. If 
such a coloring exists for a graph G,, we say that G' is k-colorable. 

For example, the graph Cs as shown in Figure 1.85 is 3-colorable: P(a) = 
P(c) = 1, P(b) = P(d) = 2, P(c) = 3. Since Cs is 3-colorable, a direct 


a 


d Cc 


FIGURE 1.85. C's is 3-colorable. 


consequence of the definition is that Cs is k-colorable for all k > 3. Is C5 2- 
colorable? 

Another way of viewing a proper k-coloring is as an assignment of vertices 
to sets, called color classes, where each set represents vertices that all receive the 
same color. For the coloring to be proper, each color class must be an independent 
set of vertices. 

It is natural to wonder how many colors are necessary to color a particular graph 
G. For instance, we know that three colors are enough for the graph in Figure 1.85, 
but is this the least required? A quick check of C's; reveals that coloring with 
two colors is impossible. So three colors are necessary. This idea motivates a 
definition. 

Given a graph G, the chromatic number of G’, denoted by .(G), is the smallest 
integer k such that G is k-colorable. In our example, we can say that y(Cs) = 3. 
What about odd cycles in general? (Try one!) What about even cycles? (Try one!) 
Here is a list of chromatic numbers for some common graphs. Verify them! 


2 if nis even, 
(Cn) = { 3. if nis odd, 
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2 ifn>2, 
(Pa) = { 1 ifn=1, 
x(n) =n, 
X(En) = ) 
x(Kom in) = 


Exercises 


1. Find the chromatic number of each of the following graphs. Explain your 


answers completely. 


(a) Trees. 

(b) Bipartite graphs. 

(c) Complete multipartite graphs, K;,, 7. )....7r;+ 

(d) The Petersen Graph. 

(e) The graph in Figure 1.86, called the Birkhoff Diamond. 
(f) The graphs of the regular polyhedra in Figure 1.82. 


FIGURE 1.86. The Birkhoff Diamond. 


2. Senate committees C, through C7 consist of the members as indicated: 


C, = {Adams, Bradford, Charles}, C2 = {Charles, Davis, Eggers}, 

C3 = {Davis, Ford}, Cy = {Adams, Gardner}, Cs = {Eggers, Howe}, 
Co = {Eggers, Bradford, Gardner}, C7 = {Howe, Charles, Ford}. 

Use the ideas of this section to determine the fewest number of meeting 
times that need to be scheduled for these committees. 


. When issuing seating assignments for his third grade students, the teacher 
wants to be sure that if two students might interfere with one another, then 
they are assigned to different areas of the room. There are six main trou- 
blemakers in the class: John, Jeff, Mike, Moe, Larry, and Curly. How many 
different areas are required in the room if John interferes with Moe and 


88 1. Graph Theory 


Curly; Jeff interferes with Larry and Curly; Mike interferes with Larry and 
Curly; Moe interferes with John, Larry, and Curly; Larry interferes with 
Jeff, Mike, Moe, and Curly; and Curly interferes with everyone? 


4. Prove that adding an edge to a graph increases its chromatic number by at 
most one. 


5. Prove that a graph G' of order at least two is bipartite if and only if it is 
2-colorable. 


6. A graph G is called k-critical if y(G) = k and y(G — v) < k for each 
vertex v of G. 


(a) Find all 1-critical and 2-critical graphs. 

(b) Give an example of a 3-critical graph. 

(c) If G is k-critical, then show that G is connected. 
(d) If Gis k-critical, then show that 6(G) > k—1. 
(e) Find all of the 3-critical graphs. Hint: Use part (d). 


1.6.2. Bounds on Chromatic Number 


The point is, ladies and gentlemen, that greed, for lack of a better 
word, is good. Greed is right. Greed works. 
— Gordon Gekko, in Wall Street 


In general, determining the chromatic number of a graph is hard. While small or 
well-known graphs (like the ones in the previous exercises) may be fairly easy, 
the number of possibilities in large graphs makes computing chromatic numbers 
difficult. We therefore often rely on bounds to give some sort of idea of what the 
chromatic number of a graph is, and in this section we consider some of these 
bounds. 

If G is a graph on n vertices, then an obvious upper bound on \(G) is n, since 
an n-coloring is always possible on a graph with n vertices. This bound is exact 
for complete graphs, as it takes as many colors as there are vertices to color a 
complete graph. In fact, complete graphs are the only graphs for which this bound 
is sharp (see Exercise 5). We set this aside as Theorem 1.41. 


Theorem 1.41. For any graph G of order n, yx(G) <n. 


Let us now discuss a very basic graph coloring algorithm, the greedy algorithm. 
To color a graph having n vertices using this algorithm, first label the vertices in 
some order—call them v1, v2, ..., Un. Next, order the available colors in some 
way. We will denote them by the positive integers 1, 2, ..., m. Then start coloring 
by assigning color | to vertex v,. Next, if vy and v2 are adjacent, assign color 
2 to vertex v2; otherwise, use color | again. In general, to color vertex v;, use 
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the first available color that has not been used for any of v;’s previously colored 
neighbors. 

For example, the greedy algorithm produces the coloring on the right from the 
graph on the left in Figure 1.87. First, v; is assigned color 1; then v2 is assigned 
color 1, since v2 is not adjacent to v;. Then v3 is assigned color | since it is not 
adjacent to v1 or vg. Vertex v4 is assigned color 2, then vs is assigned 2, and 
finally ve is assigned 2. 


vy V4 2) 1 2 1 

ra ea 

Vs V3 V6 2 1 2 
FIGURE 1.87. Applying the greedy algorithm. 


It is important to realize that the coloring obtained by the greedy algorithm 
depends heavily on the initial labeling of the vertices. Different labelings can (and 
often do) produce different colorings. Figure 1.88 displays the coloring obtained 
from a different original labeling of the same graph. More colors are used in this 


ra ra 
V6 V4 Vv 1 3 1 
FIGURE 1.88. Applying it again. 


second coloring. We see that while “greed works” in that the algorithm always 
gives a legal coloring, we cannot expect it to give us a coloring that uses the 
fewest possible colors. 

The following bound improves Theorem 1.41. 


Theorem 1.42. For any graph G, x(G) < A(G) + 1, where A(G) is the maxi- 
mum degree of G. 


Proof. Running the greedy algorithm on G produces a legal coloring that uses 
at most A(G) + 1 colors. This is because every vertex in the graph is adjacent 
to at most A(G) other vertices, and hence the largest color label used is at most 
A(G) + 1. Thus, x(G) < A(G) +1. O 


Notice that we obtain equality in this bound for complete graphs and for cycles 
with an odd number of vertices. As it turns out, these are the only families of 
graphs for which the equality in Theorem 1.42 holds. This is stated in Brooks’s 
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Theorem [41]. The proof that we give is a modification of the one given by Lovasz 
[190]. 


Theorem 1.43 (Brooks’s Theorem). /f G' is a connected graph that is neither an 
odd cycle nor a complete graph, then x(G') < A(G). 


Proof. Let G be a connected graph of order 7 that is neither a complete graph nor 
an odd cycle. Let k = A(G). We know that k 4 0 and k ¥ 1, since otherwise 
G is complete. If k = 2, then G must be either an even cycle or a path. In either 
case, y(G) = 2 = A(G). So assume that k = A(G) > 3. 

We are now faced with three cases. In each case we will establish a labeling of 
the vertices of G in the form v1, v2, ..., Un. We will then use the greedy algorithm 
to color G with no more than & colors. 

Case 1. Suppose that G' is not k-regular. Then there exists some vertex with 
degree less than k. Choose such a vertex and call it v,. Let So = {v,} and let 
S, = N(v,), the neighborhood of v,,. Further, let 


So = N(S1) = {un} = Si, 
53 = N(S2) — $1 — So, 


S; = N(S,-1) — Si-2 — Si-1, 


for each 7 (Figure 1.89). Since G is finite, there is some ¢ such that S; is not empty, 


FIGURE 1.89. The sets Sj. 


and S,. is empty for all r > t. 

Next, label the vertices in S with the labels v;_1, Un—o,..., Un—|S,|- Label the 
vertices in Sz with the labels v,_)¢,)-1, ---» Un—|$1|—|S2|- Continue labeling in 
this decreasing fashion until all vertices of G have been labeled. The vertex with 
label v, is in the set S}. 

Let u be a vertex in some S;, 7 > 1. Since wu has at least one neighbor in S;_1, 
it has at most k — 1 adjacencies with vertices whose label is less than its own. 
Thus, when the greedy algorithm gets to wu, there will be at least one color from 
{1,2,...,k} available. Further, since deg(u,) < k, there will be a color from 
{1,2,...,k} available when the greedy algorithm reaches v,,. Thus, in this case 
the greedy algorithm uses at most & colors to properly color G. 
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Case 2. Suppose that G' is k-regular and that G has a cut vertex, say v. The 
removal of v from G will form at least two connected components. Say the com- 
ponents are G'}, Go, ..., G;. Consider the graph H, = G, U {v} (the component 
G with v added back—see Figure 1.90). H; is a connected graph, and the degree 


FIGURE 1.90. The graph Hy. 


of v in # is less than k. Using the method in Case 1, we can properly color Hy 
with at most & colors. Similarly, we can properly color each H; = G'; — {v} with 
at most k colors. Without loss of generality, we can assume that v gets the same 
color in all of these colorings (if not, just permute the colors to make it so). These 
colorings together create a proper coloring of G that uses at most & colors. Case 
2 is complete. 

Case 3. Suppose that G is k-regular and that it does not contain a cut vertex. 
This means that G is 2-connected. 

Subcase 3a. Suppose that G' is 3-connected. This means that for all v, the graph 
G — v is 2-connected. Let v be a vertex of G with neighbors v; and v2 such that 
v1v2 ¢ E(G) (such vertices exist since G is not complete). By the assumption in 
this subcase, the graph G — {v1, v2} is connected. 

Subcase 3b. Suppose that G' is not 3-connected. This means that there exists 
a pair of vertices v, w such that the graph G — {v, w} is disconnected. Let the 
components of G — {v, w} be Gi, Go, ..., Gy. Since k > 3, it must be that each 
G; has at least two vertices. It also must be that v is adjacent to at least one vertex 
in each G‘;, since w is not a cut vertex of G. Let u € V(G1) be a neighbor of v. 
Suppose for the moment that u is a cut vertex of the graph G — v. If this is the 
case, then there must be another vertex y of G such that (i) y is not a cut vertex 
of the graph G' — v, and (ii) the only paths from y to w in G'— v go through vertex 
u. Since u is not a cut vertex of G itself, it must be that y is adjacent to v. In either 
case, it must be that v has a neighbor in G; (either u or y) that is not a cut vertex 
of G — v. The vertex v has a similar such neighbor in G'2. For convenience, let us 
rename: For i = 1, 2, let v; € V(G;) be a neighbor of v that is not a cut vertex 
of the graph G' — v. Vertices v; and vg are nonadjacent, and since they were in 
different components of G — {v, w}, it must be that G — {v 1, v2} is connected. 

In each subcase, we have identified vertices v, v;, and vg such that vv 1, vve € 
E(G), vive € E(G), and G — {v1, v2} is connected. We now proceed to label 
the vertices of G in preparation for the greedy algorithm. 
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Let v; and v2 be as labeled. Let v be labeled v,,. Now choose a vertex adja- 
cent to uv, that is not v; or v2 (such a vertex exists, since deg(v,,) > 3). Label 
this vertex v,—1. Next choose a vertex that is adjacent to either v,, or v;_1 and 
is not V1, V2, Un, OF Un—1. Call this vertex v,—2. We continue this process. Since 


G — {v1,v2} is connected, then for each i € {3,...,n — 1}, there is a ver- 
tex v; © V(G) — {v1, 02, Un, Un—1,---, Vi+1} that is adjacent to at least one of 
Vit 1s+++5 Un 


Now that the vertices are labeled, we can apply the greedy algorithm. Since 
v v2 ¢ E(G), the algorithm will give the color | to both v; and v2. Since each 
Uj, 3 <4 <n, is adjacent to at most k — 1 predecessors, and since v,, is adjacent 
to v; and v2, the algorithm never requires more than k = A(G) colors. Case 3 is 
complete. O 


The next bound involves a new concept. 

The clique number of a graph, denoted by w(G), is defined as the order of 
the largest complete graph that is a subgraph of G.. For example, in Figure 1.91, 
w(G1) = 3 and w(G2) = 4. 


FIGURE 1.91. Graphs with clique numbers 3 and 4, respectively. 


A simple bound that involves clique number follows. We leave it to the reader 
to provide a (one or two line) explanation. 


Theorem 1.44. For any graph G, x(G) > w(G). 


It is natural to wonder whether we might be able to strengthen this theorem and 
prove that y(G') = w(G) for every graph G. Unfortunately, this is false. Consider 
the graph G' shown in Figure 1.92. The clique number of this graph is 5, and the 


FIGURE 1.92. Is x(G) = w(G)? 
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chromatic number is 6 (see Exercise 2). 

The upper and lower bounds given in Theorem 1.45 concern a(G), the inde- 
pendence number of G, defined back in Section 1.4.3. The proofs are left as an 
exercise (see Exercise 6). 


Theorem 1.45. For any graph G of order n, 


n 


a(G) 


<x(G)<n+1-a(G). 


Exercises 


1. Recall that avgdeg(G') denotes the average degree of vertices in G. Prove 
or give a counterexample to the following statement: 


x(G) < 14+ avgdeg(G). 


2. If G is the graph in Figure 1.92, prove that y(G) = 6 and w(G) = 5. 


3. Determine a necessary and sufficient condition for a graph to have a 2- 
colorable line graph. 


4. Recall that 7(G) denotes the number of vertices in a detour path (a longest 
path) of G, prove that y(G) < 7(G). 


5. Prove that the only graph G of order n for which y(G) = nis Ky. 


6. Prove that for any graph G of order n, 


n 


a(G) 


<x(G) <n+1-a(G). 


7. If G is bipartite, prove that w(G) = x(G). 
8. Let G be a graph of order n. Prove that 
(a) n< x(G)x(@); 
(b) 2/n < x(G) + x(G). 


1.6.3. The Four Color Problem 


That doesn’t sound too hard. 
— Princess Leia, Star Wars 


The Four Color Problem. Js it true that the countries on any given map can 
be colored with four or fewer colors in such a way that adjacent countries are 
colored differently? 


94 1. Graph Theory 


The seemingly simple Four Color Problem was introduced in 1852 by Francis 
Guthrie, a student of Augustus DeMorgan. The first written reference to the prob- 
lem is a letter from DeMorgan to Sir William Rowan Hamilton. Despite Hamil- 
ton’s indifference!>, DeMorgan continued to talk about the problem with other 
mathematicians. In the years that followed, many of the world’s top mathematical 
minds attempted either to prove or disprove the conjecture, and in 1879 Alfred 
Kempe announced that he had found a proof. In 1890, however, P. J. Heawood 
discovered an error in Kempe’s proof. Kempe’s work did have some positive fea- 
tures, though, for Heawood made use of Kempe’s ideas to prove that five colors 
always suffice. In this section, we translate the Four Color Problem into a graph 
theory problem, and we prove the Five Color Theorem. 

Any map can be represented by a planar graph in the following way: Repre- 
sent each country on the map by a vertex, and connect two vertices with an edge 
whenever the corresponding countries share a nontrivial border (more than just a 


point). Some examples are shown in Figure 1.93. 


FIGURE 1.93. Graph representations of maps. 


| 


| 


The regions on the map correspond to vertices on the graph, so a graph col- 
oring yields a map coloring with no bordering regions colored the same. This 
natural representation allows us to see that a map is 4-colorable if and only if its 
associated graph is 4-colorable. 

The Four Color Conjecture is equivalent to the following statement. A thorough 
discussion of this equivalence can be found in [52]. 


'5Perhaps he was too busy perfecting plans for a cool new game that he would release a few years 
later. See Section 1.4.3. 
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Theorem 1.46 (Four Color Theorem). Every planar graph is 4-colorable. 


When Heawood pointed out the error in Kempe’s proof, researchers flocked 
back to the drawing board. People worked on the Four Color Problem for years 
and years trying numerous strategies. Finally, in 1976, Kenneth Appel and Wolf- 
gang Haken, with the help of John Koch, announced that they had found a proof 
[12]. To complete their proof, they verified thousands of cases with computers, 
using over 1000 hours of computer time. As you might imagine, people were 
skeptical of this at first. Was this really a proof? How could an argument with so 
many cases be verified? 

While the Appel—Haken proof is accepted as being valid, mathematicians still 
search for alternative proofs. Robertson, Sanders, Seymour, and Thomas [239] 
have probably come the closest to finding a short and clever proof, but theirs still 
requires a number of computer calculations. 

In a 1998 article [267], Robin Thomas said the following. 


For the purposes of this survey, let me telescope the difficulties with 
the A&H proof into two points: (1) part of the proof uses a computer 
and cannot be verified by hand, and (2) even the part that is suppos- 
edly hand-checkable has not, as far as I know, been independently 
verified in its entirety. ... Neil Robertson, Daniel P. Sanders, Paul 
Seymour, and I tried to verify the Appel—Haken proof, but soon gave 
up and decided that it would be more profitable to work out our own 
proof. ... We were not able to eliminate reason (1), but we managed 
to make progress toward (2). 


As mentioned earlier, Heawood [156] provided a proof of the Five Color 
Theorem in the late 1890s, and we present his proof here. Some of the ideas 
in his proof came from Kempe’s attempt [174] to solve the Four Color Problem. 


Theorem 1.47 (Five Color Theorem). Every planar graph is 5-colorable. 


Proof. We induct on the order of G. Let G' be a planar graph of order n. If n < 5, 
then the result is clear. So suppose that n > 6 and that the result is true for all 
planar graphs of order n — 1. From Theorem 1.35, we know that G' contains a 
vertex, say v, having deg(v) < 5. 

Consider the graph G’ obtained by removing from G the vertex v and all edges 
incident with v. Since the order of G’ is n — 1 (and since G’ is of course planar), 
we can apply the induction hypothesis and conclude that G’ is 5-colorable. Now, 
we can assume that G’ has been colored using the five colors, named 1, 2, 3, 4, 
and 5. Consider now the neighbors of v in G. As noted earlier, v has at most five 
neighbors in G’,, and all of these neighbors are vertices in (the already colored) G’. 

If in G’ fewer than five colors were used to color these neighbors, then we can 
properly color G by using the coloring for G’ on all vertices other than v, and by 
coloring v with one of the colors that is not used on the neighbors of v. In doing 
this, we have produced a 5-coloring for G. 
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So, assume that in G’ exactly five of the colors were used to color the neighbors 
of v. This implies that there are exactly five neighbors, call them wy , we, w3, wa, 
ws, and assume without loss of generality that each w, is colored with color 7 (see 
Figure 1.94). 


Wi 
5 2, 
v 
Ws W2 
W4 W3 
4 3 
FIGURE 1.94. 


We wish to rearrange the colors of G’ so that we make a color available for 
v. Consider all of the vertices of G’ that have been colored with color | or with 
color 3. 

Case 1. Suppose that in G’ there does not exist a path from w; to ws where all 
of the colors on the path are | or 3. Define a subgraph H of G’ to be the union of 
all paths that start at w; and that are colored with either 1 or 3. Note that w3 is not 
a vertex of H and that none of the neighbors of w3 are in H (see Figure 1.95). 


FIGURE 1.95. 


Now, interchange the colors in H. That is, change all of the 1’s into 3’s and 
all of the 3’s into 1’s. The resulting coloring of the vertices of G’ is a proper 
coloring, because no problems could have possibly arisen in this interchange. We 
now see that w, is colored 3, and thus color | is available to use for v. Thus, G is 
5-colorable. 

Case 2. Suppose that in G’ there does exist a path from wy, to w3 where all of 
the colors on the path are | or 3. Call this path P. Note now that P along with v 
forms a cycle that encloses either wz or w4 (Figure 1.96). 
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or 


FIGURE 1.96. Two possibilities. 


So there does not exist a path from w2 to w4 where all of the colors on the 
path are 2 or 4. Thus, the reasoning in Case | applies! We conclude that G is 
5-colorable. O 


Exercises 


1. Determine the chromatic number of the graph of the map of the United 
States. 


2. Determine the chromatic number of the graph of the map of the countries 
of South America. 


3. Determine the chromatic number of the graph of the map of the countries 
of Africa. 


4. Determine the chromatic number of the graph of the map of the countries 
of Australia. Hint: This graph will be quite small! 


5. Where does the proof of the Five Color Theorem go wrong for four colors? 


1.6.4 Chromatic Polynomials 


Everything should be made as simple as possible, but not simpler. 
— Albert Einstein 


Chromatic polynomials, developed by Birkhoff in the early 1900s as he studied 
the Four Color Problem, provide us with a method of counting the number of 
different colorings of a graph. 

Before we introduce the polynomials, we should clarify what we mean by dif- 
ferent colorings. Given a graph G, suppose that its vertices are labeled v1, ve, 
... Un. A coloring of G is an assignment of colors to these vertices, and we call 
two colorings C', and C2 different if at least one v; receives a different color in 
C; than it does in C2. For instance, the two colorings of K’4 shown in Figure 1.97 
are considered different, since v3 and v4 receive different colorings. 
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red blue red blue 
Vy Vy Vi Vy 
green white white green 


FIGURE 1.97. Two different colorings. 


If we restrict ourselves to four colors, how many different colorings are there 
of K4? Since there are four choices for v;, then three for v2, etc., we see that 
there are 4. 3-2-1 different colorings of K’4 using four colors. If six colors were 
available, there would be 6-5-4-3 different colorings. If only two were available, 
there would be no proper colorings of K4. 

In general, define cg(k) to be the number of different colorings of a graph G 
using at most k colors. So we have cx, (4) = 24, cx, (6) = 360, and cx, (2) = 0. 
In fact, if / and n are positive integers where k > n, then 


cx, (k) = k(k — I(k—2)-+-(kK-—n +1). 


Further, if k < n, then cx, (k) = 0. We also note that cp, (k) = k” for all 
positive integers k and n. 

A simple but important property of c¢(k) is that G' is k-colorable if and only if 
cq(k) > 0. Equivalently, cg(k) > 0 if and only if x(G) < k. 

Finding values of cq(k) is relatively easy for some well-known graphs. Com- 
puting this function in general, though, can be hard. Birkhoff and Lewis [27] 
developed a way to reduce this hard problem to an easier one. Before we see their 
method, we need a definition. 

Let G be a graph and let e be an edge of G. Recall that G — e denotes the graph 
where e is removed from G. Define the graph G/e to be the graph obtained from 
G by removing e, identifying the end vertices of e, and leaving only one copy of 
any resulting multiple edges. 

As an example, a graph G and the graphs G — bc and G/bc are shown in Fig- 
ure 1.98. 


Theorem 1.48. Let G be a graph and e be any edge of G. Then 
ca(k) = cg—e(k) — cg/e(k). 


Proof. Suppose that the end vertices of e are u and v, and consider the graph 
Ge. 

How many k-colorings are there of G — e where u and v are assigned the same 
color? If C is a such a coloring of G'— e, then C can be thought of as a coloring of 
G'/e, since u and v are colored the same. Similarly, any coloring of G/e can also 
be thought of as a coloring of G — e where wu and v are colored the same. Thus, 
the answer to this question is cg/.(k). 
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FIGURE 1.98. Examples of the operations. 


Now, how many k-colorings are there of G' — e where wu and v are assigned 
different colors? If C' is a such a coloring of G — e, then C' can be considered as a 
coloring of G, since u and v are colored differently. Similarly, any coloring of G 
can also be thought of as a coloring of G—e where u and v are colored differently. 
Thus, the answer to this second question is cg(k). 

Thus, the total number of k-colorings of G — e¢ is 


ca-e(k) = ca/e(k) + ca(k), 
and the result follows. O 


For example, suppose we want to find cp, (k). That is, how many ways are there 
to color the vertices of P, with k colors available? We label the edges of P, as 
shown in Figure 1.99. 


O——-——- 0. 
P, 
e; 5) 3 
FIGURE 1.99. The labeled edges of P4. 


The theorem implies that 


CPy (k) — CPy—e1 (k) — CPy/ey (k). 


For convenience, let us denote P, — e; and P,/e; by Gy, and Gig, respectively 
(see Figure 1.100). 


G, e e——_e—_e G,, e——_e——_e 
2) €3 3) 3 


FIGURE 1.100. The first application. 
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Applying the theorem again, we obtain 


cp,(k) = CG@11-e2 (k) — C41 /e2 (k) — CG@i2-e2 (k) + CG12/e2 (k). 


Denote the graphs G11 — €2, Gis /e2, Gig — e2, and G12/e by Gai, Go2, Gos, 
and Gq, respectively (see Figure 1.101). 


G,, @ ° o——o G,, @ o——o 
e3 e3 
Gy e e Gy, e 
3 3 


FIGURE 1.101. The second application. 


Applying the theorem once more yields 
CPy (k) = exe, (Kh) — CGo1/e3 (k) — CGao—es (kK) + CG 22 /e3 (k) 
— CG@23-e3 (k) 7 CGa3 /eg(*) 1 CGo4—e3(k) — CGo4/es (k). 

That is, 

cp,(k) = CE,(k)—Cp,(k) cx, (k)+cx,(k) —cr, (hk) +en, (k)+cx,(k)—cr, (k). 
Thus, 

cp,(k) =k -RP-RP4+RP-—R +h +h Kk 
= k* — 3h? + 3k? —k. 


We should check a couple of examples. How many colorings of P, are there 
with one color? 
ép,(1) = 1* —3(1)? + 3(1)? -1=0. 


This, of course, makes sense. And how many colorings are there with two colors? 


ep, (2) = 2* — 3(2)? + 3(2)? —2 =2. 
Figure 1.102 shows these two colorings. Score one for Birkhoff! 


red _— blue red __ blue blue red blue red 
o—_o>—___o——_0 —————_o—_—_9 


FIGURE 1.102. Two 2-colorings of P4. 


As you can see, chromatic polynomials provide a way to count colorings, and 
the Birkhoff—Lewis theorem allows you to reduce a problem to a slightly simpler 
one. We should note that it is not always necessary to work all the way down to 
empty graphs, as we did in the previous example. Once a graph G is obtained for 
which the value of cg(k) is known, there is no need to reduce that one further. 

We now present some properties of cg(k). 
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Theorem 1.49. Let G be a graph of order n. Then 


1. 


2. 


cqa(k) is a polynomial in k of degree n, 


the leading coefficient of cg(k) is 1, 


. the constant term of cg(k) is 0, 
. the coefficients of cg(k) alternate in sign, and 


. the absolute value of the coefficient of the k"~' term is the number of edges 


in G. 


We leave the proof of this theorem as an exercise (Exercise 3). One proof strat- 
egy is to induct on the number of edges in G' and use the Birkhoff—Lewis reduction 
theorem (Theorem 1.48). 

Before leaving this section, we should note that Birkhoff considered chromatic 
polynomials of planar graphs, and he hoped to find one of them that had 4 as a 
root. If he had found one, then the corresponding planar graph would not be 4- 
colorable, and hence would be a counterexample to the Four Color Conjecture. 
Although he was unsuccessful in proving the Four Color Theorem, he still de- 
serves credit for producing a very nice counting technique. 


Exercises 


1. 


Find chromatic polynomials for each of the following graphs. For each one, 
determine how many 5-colorings exist. 

(a) K1,3 

(b) Ai 5 

(c) C4 

(d) Cs 

(e) K4—e 

(f) Ks —e 


2. Show that k* — 4k? + 3k? is not a chromatic polynomial for any graph. 


3. 


Prove Theorem 1.49. 


4. Determine the chromatic polynomial for a tree of order n. 


1.7 


Matchings 


Pardon me, do you have a match? 
— James Bond, in From Russia with Love 
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The Senate committees that we discussed earlier need to form an executive coun- 
cil. Each committee needs to designate one of its members as an official repre- 
sentative to sit on the council, and council policy states that no senator can be the 
official representative for more than one committee. For example, let us suppose 
there are four committees: Senators A, B, C, and D are on Committee 1; Senators 
A, EL, and F are members of Committee 2; Committee 3 consists of L', F', and D; 
and Senator A is the only member of Committee 4. In this example, the executive 
council could consist of A, FE, F’, and C—representing Committees 4, 3, 2, and 
1, respectively. 

As another example, suppose Committee 1 consists of W, X and Y; Commit- 
tee 2 of W, X, and Z; Committee 3 of W, Y, and Z; Committee 4 of X, Y, and 
Z; and Committee 5 of W and Z. It does not take long to see that it is impossible 
in this case to select official representatives according to the policy. 

So a natural question arises: Under what circumstances can the executive coun- 
cil be formed successfully? In the sections that follow, we will see how graphs 
can be used to help answer this question. 


1.7.1 Definitions 


And as to the meaning... 
—C.S. Calverly, Ballad 


A matching in a graph is a set of independent edges. That is, it is a set of edges 
in which no pair shares a vertex. Given a matching // in a graph G, the vertices 
belonging to the edges of Mare said to be saturated by M (or M-saturated). The 
other vertices are M/-unsaturated. 

Consider the graph G' shown in Figure 1.103. An example of a matching in G 
is M, = {ab, ce, df}. Mz = {cd, ab} is also a matching, and so is M3 = {df}. 
We can see that a, b, c, d are M9-saturated and e, f, and g are M/2-unsaturated. 
The only (/;-unsaturated vertex is g. 


FIGURE 1.103. The matching M4. 


If a matching M saturates every vertex of G, then M is said to be a perfect 
matching. In Figure 1.104, G; has a perfect matching, namely {ab, ch, de, fg}. 
None of G2, G3, and G'4 has a perfect matching. Why is this? We will talk more 
about perfect matchings in Section 1.7.4. 

A maximum matching in a graph is a matching that has the largest possible 
cardinality. A maximal matching is a matching that cannot be enlarged by the 
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G, 


@—_e—__e—_e—_ 


FIGURE 1.104. Only G; has a perfect matching. 


addition of any edge. In Figure 1.105, M; = {ae,bf,cd,gh} is a maximum 
matching (since at most one of gh, gi, and gj can be in any matching). The match- 
ing Mp = {dg, af, bc} is maximal, but not maximum. 


FIGURE 1.105. 


Exercises 


1. Determine whether the graph of Figure 1.106 has a perfect matching. If so, 
then exhibit it. If not, explain why. 


FIGURE 1.106. Is there a perfect matching? 


2. Find the minimum size of a maximal matching in each of the following 
graphs. 


(a) Cio 
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(b) Ci, 
(c) Ch 


3. (From [52].) The matching graph M(G) of a graph G has the maximum 
matchings of G as its vertices, and two vertices MM, and Mz of M(G) are 
adjacent if Md, and Mg differ in only one edge. Show that each cycle C,,, 
n = 3,4, 5, or 6, is the matching graph of some graph. 


1.7.2. Hall’s Theorem and SDRs 


I'll match that! 
— Monty Hall, Let’s Make a Deal 


In this section we consider several classic results concerning matchings. We begin 
with a few more definitions. 

Given a graph G and a matching /, an M-alternating path is a path in G where 
the edges alternate between J/-edges and non-M-edges. An M-augmenting path 
is an \/-alternating path where both end vertices are /-unsaturated. 

As an example, consider the graph G and the matching / indicated in Fig- 
ure 1.107. An example of an M-alternating path is c, a, d, e, 7. An example of 
an M-augmenting path is 7, g, f, a, c, b. The reason for calling such a path “/- 
augmenting” will become apparent soon. 


J 
FIGURE 1.107. The graph G and matching M. 


The following result is due to Berge [23]. 


Theorem 1.50 (Berge’s Theorem). Let M be a matching in a graph G. M is 
maximum if and only if G contains no M -augmenting paths. 


Proof. First, assume that MM is a maximum matching, and suppose that P : 
U1, U2,--.,Uz iS an M-augmenting path. Due to the alternating nature of M/- 
augmenting paths, it must be that k is even and that the edges v2v3, vaus, ...; 
UR—2UK—1 are all edges of IW. We also see that the edges v1 v2, 34, ..-, Uk—1Uk 
are not edges of MM (Figure 1.108). 


But then if we define the set of edges MM, to be 


My, = (M \ {v2u3,..., Uk—-2Up—1}) U {v102,..-,UR—-1Uk}, 
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Vi Vo V3 Va Vs Vo Ve-2 Ve-1 Ve 


FIGURE 1.108. An M-augmenting path. 


then /, is a matching that contains one more edge than 7, a matching that we 
assumed to be maximum. This is a contradiction, and we can conclude that G 
contains no //-augmenting paths. 

For the converse, assume that G' has no 1/-augmenting paths, and suppose that 
M’ is a matching that is larger than 1/7. Define a subgraph H of G as follows: Let 
V(H) = V(G) and let E(/) be the set of edges of G that appear in exactly one 
of M and M’. Now consider some properties of this subgraph H. Since each of 
the vertices of G lies on at most one edge from / and at most one edge from MM’, 
it must be that the degree (in H) of each vertex of H is at most 2. This implies 
that each connected component of H is either a single vertex, a path, or a cycle. 
If a component is a cycle, then it must be an even cycle, since the edges alternate 
between M-edges and M’-edges. So, since |/’| > |], there must be at least 
one component of H that is a path that begins and ends with edges from MM’. But 
this path is an /-augmenting path, contradicting our assumption. Therefore, no 
such matching M’ can exist—implying that IV is maximum. O 


Before we see Hall’s classic matching theorem, we need to define one more 
term. If G is a bipartite graph with partite sets X and Y, we say that X can be 
matched into Y if there exists a matching in G that saturates the vertices of X. 

Consider the two examples in Figure 1.109. In the bipartite graph on the left, 


xX ¥ xX a 


FIGURE 1.109. 


we see that X can be matched into Y. In the graph on the right, though, it is 
impossible to match X into Y (why is this?). What conditions on a bipartite graph 
must exist if we want to match one partite set into the other? The answer to this 
question is found in the following result of Hall [147] (Philip, not Monty). 

Recall that the neighborhood of a set of vertices S, denoted by N(.S), is the 
union of the neighborhoods of the vertices of S. 
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Theorem 1.51 (Hall’s Theorem). Let G be a bipartite graph with partite sets X 
and Y. X can be matched into Y if and only if |N(S)| > |S| for all subsets S of 
Xx. 


Proof. First suppose that X can be matched into Y, and let S' be some subset of 
X. Since S itself is also matched into Y, we see immediately that |S] < |N(S)| 
(see Figure 1.110). Now suppose that |N(S)| > |S| for all subsets S of X, and 


FIGURE 1.110. 


let 7 be a maximum matching. Suppose that w € X is not saturated by M (see 
Figure 1.111). Define the set A to be the set of vertices of G that can be joined to u 


X Y 


FIGURE 1.111. 


by an M-alternating path. Let S = AMX, and let T = ANY (see Figure 1.112). 
Notice now that Berge’s Theorem implies that every vertex of T' is saturated by 


(0 
X Y 


FIGURE 1.112. 
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M and that wu is the only unsaturated vertex of S. That is, every vertex of T is 
saturated, and every vertex of S \ {u} is saturated. This implies that |T| = |S|—1. 
It follows from Berge’s Theorem and the definition of T that V(.S) = 7. But then 
we have that |V(S')| = |S| — 1 < |S|, and this is a contradiction. We conclude 
that such a vertex u cannot exist in _X and that M saturates all of X. O 


Given some family of sets X, a system of distinct representatives, or SDR, for 
the sets in X can be thought of as a “representative” collection of distinct elements 
from the sets of X. For instance, let 51, S2, $3, S4, and Ss be defined as follows: 


S1= {2,8}, 
So = {8}, 
S3= {5,7}, 
S4 = {2,4, 8}, 
Ss = {2,4}. 


The family X, = {$1, S2,.53,.54} does have an SDR, namely {2,8, 7,4}. The 
family Xp = {.$), S2, S41, .S5} does not have an SDR. 

So under what conditions will a finite family of sets have an SDR? We answer 
this question with the following theorem. 


Theorem 1.52. Let 51, So, ..., Sx be a collection of finite, nonempty sets. This 
collection has an SDR if and only if for every t € {1,...,k}, the union of any t 
of these sets contains at least t elements. 


Proof. Since each of the sets is finite, then of course S = S; US U---U Sx is 
finite. Let us say that the elements of S are a1, ..., Gn. 

We now construct a bipartite graph with partite sets X = {.S),...,S,} and 
Y = {a1,...,@} (Figure 1.113). We place an edge between 5; and a, if and 
only if a; € Sj. 


FIGURE 1.113. Constructing a bipartite graph. 


Hall’s Theorem now implies that X can be matched into Y if and only if | A| < 
|N(A)| for all subsets A of X. In other words, the collection of sets has an SDR 
if and only if for every t € {1,...,k}, the union of any t of these sets contains at 
least t elements. O 
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Hall’s Theorem is often referred to as Hall’s Marriage Theorem. We will see 
more about this in Section 2.9. 


Exercises 


1. (From [56].) For the graphs of Figure 1.114, with matchings as shaded, 
find 
(a) an /-alternating path that is not (/-augmenting; 


(b) an M-augmenting path if one exists; and, if so, use it to obtain a bigger 
matching. 


FIGURE 1.114. 


2. For each of the following families of sets, determine whether the condition 
of Theorem 1.52 is met. If so, then find an SDR. If not, then show how the 
condition is violated. 

(a) {1,2, 3}, {2,3,4}, {3,4, 5}, {4,5}, {1, 2,5} 

(b) {1,2,4}, {2,4}, {2, 3}, {1,2, 3} 

(c) {1,2}, {2,3}, {1, 2,3}, {2, 3, 4}, {1, 3}, {3, 4} 
(d) {1,2, 5}, {1,5}, {1,2}, {2,5} 

(e) {1,2, 3}, {1, 2,4}, {1,3, 4}, {1, 2,3, 4}, {2,3, 4} 


3. Let G be a bipartite graph. Show that G has a matching of size at least 
|E(G)|/A(G). 


4. Let 0 = {S$}, So,...,S;-} be a family of distinct nonempty subsets of the 
set {1,2,...,m}. If the S; are all of the same cardinality, then prove that 
there exists an SDR of O. 
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5. Let My, and Mp be matchings in a bipartite graph G with partite sets X and 
Y.If S C X is saturated by MW, and T C Y is saturated by M2, show that 
there exists a matching in G that saturates S UT. 


6. (From [139].) Let G be a bipartite graph with partite sets X and Y. Let dx 
denote the minimum degree of the vertices in X, and let Ay denote the 
maximum degree of the vertices in Y. Prove that if dx > Ay, then there 
exists a matching in G' that saturates X. 


1.7.3 The Kénig—Egervary Theorem 


What I tell you three times is true. 
— Lewis Carroll, The Hunting of the Snark 


The main theorem that we present in this section is very important, for it is closely 
related to several results from other areas of graph theory. We will discuss a few 
of these areas after we have proven the theorem. 

A set C of vertices in a graph G is said to cover the edges of G if every edge of 
G is incident with at least one vertex of C’. Such a set C is called an edge cover 
of G. 

Consider the graphs G and G2 in Figure 1.115. In G4, the set {b, d, e, a} is 
an edge cover, as is the set {a, e, f}. In fact, you can see by a little examination 
that there is no edge cover G; with fewer than three vertices. So we can say that 
{a,e, f} is a minimum edge cover of G}. In G2, each of the following sets is an 
edge cover: {v1, U2, U3, U4, Us, Us } (obviously) and {uz, vg, ui}. What is the size 
of a minimum edge cover here? 


a b VY) V2 V3 V4 V5 (VG 
ft uy uy U3 U4 
FIGURE 1.115. 


We are now ready to prove the following result of Konig [180] and Egervary 
[87]. 


Theorem 1.53 (K6nig—Egervary Theorem). Let G be a bipartite graph. The max- 
imum number of edges in a matching in G equals the minimum number of vertices 
in an edge cover of G. 


Proof. Let M be a maximum matching of G. Let X and Y be the partite sets of 
G, and let W be the set of all M/-unsaturated vertices of X (see Figure 1.116). 
Note that |M| = |X| — |W]. 
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W 
xX Y 
FIGURE 1.116. 


Now let A be the set of vertices of G that can be reached via an /-alternating 
path from some vertex of W. Let S = AN X, and let T = AMY. We can note 
two things now: First, S \ W is matched to T (implying that |W| = |S| — |T]), 
and second, N(S') = T. 

If we let C = (X \ S) UT, we see that C’ covers the edges of G. So C is an 
edge cover of G, and |C| = |X| — |S| + |T| = |X| — |W| = |M]. Now suppose 
that C’ is any edge cover. Since each vertex of C’ can cover at most one edge 
of M, it must be that |C’| > |M]|. We conclude then that C' is a minimum edge 
cover. O 


The K6nig—Egervary Theorem is one of several theorems in graph theory that 
relate the minimum of one thing to the maximum of something else. What fol- 
lows are some examples of theorems that are very closely related to the K6nig— 
Egervary Theorem. 


Menger’s Theorem 


Let G be a connected graph, and let wu and v be vertices of G. If S is a subset 
of vertices that does not include u or v, and if the graph G — S' has wu and v in 
different connected components, then we say that S is a u, v-separating set. 

The following result is known as Menger’s Theorem [202]. 


Theorem 1.54. Let G be a graph and let u and v be vertices of G. The maximum 
number of internally disjoint paths from u to v equals the minimum number of 
vertices in a u, v-separating set. 


Max Flow Min Cut Theorem 


A graph can be thought of as a flow network, where one vertex is specified to be 
the source of the flow and another is specified to be the receiver of the flow. As 
an amount of material flows from source to receiver, it passes through other inter- 
mediate vertices, each of which has a particular flow capacity. The total flow of a 
network is the amount of material that is able to make it from source to receiver. 
A cut in a network is a set of intermediate vertices whose removal completely cuts 
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the flow from the source to the receiver. The capacity of the cut is simply the sum 
of the capacities of the vertices in the cut. 


Theorem 1.55. Let N be a flow network. The maximum value of total flow equals 
the minimum capacity of a cut. 


Independent Zeros 


If A is an m x n matrix with real entries, a set of independent zeros in A can 
be thought of as a set of ordered pairs {(71, 71), (i2,J2),---, (44, J) } with the 
following properties: 


a. the (ix, j,) entry of Ais 0 fork = 1,2,...,¢; 
b. ifa # b, then ig A iy and ja F Jo. 


That is, none of the zeros in the set are in the same row or column. 
Now, in this matrix A one can draw lines through each row and column that 
contains a zero. Such a set of lines is said to cover the zeros of A. 


Theorem 1.56. The maximum number of independent zeros in A is equal to the 
minimum number of lines through rows or columns that together cover all the 
zeros of A. 


Exercises 


1. Use the Konig—Egervary Theorem to prove Hall’s Theorem. 


2. Let k be some fixed integer, 1 < k < n, and let G be some subgraph of 
Ky» with more than (& — 1)n edges. Prove that G has a matching of size 
at least k. 


3. Use the original statement of the K6nig—Egervary Theorem to prove Theo- 
rem 1.56. 


1.7.4 Perfect Matchings 


It’s a perfect ending. 
— Sophie, in Anastasia 


We end this section on matchings by discussing perfect matchings. Recall that a 
perfect matching is a matching that saturates the entire vertex set of a graph. What 
kinds of graphs have perfect matchings? One thing that is clear is that a graph must 
be of even order in order to have a chance at having a perfect matching. But being 
of even order is certainly not enough to guarantee a perfect matching (look back 
at Figure 1.105). 

We do know that K2,, C2,, and P2,, have perfect matchings. The following 
result regarding perfect matchings in bipartite graphs is a corollary to Hall’s The- 
orem. The proof is left as an exercise (Exercise 5). 
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Corollary 1.57. If G is a bipartite graph that is regular of degree k, then G 
contains a perfect matching. 


It seems very natural to think that the more edges a graph has, the more likely 
it is that the graph will have a perfect matching. The following theorem verifies 
this thought, to a degree. 


Theorem 1.58. If G is a graph of order 2n such that 6(G) > n, then G has a 
perfect matching. 


Proof. Let G be a graph of order 2n with 6(G) > n. Dirac’s theorem (Theo- 
rem 1.22) guarantees the existence of a Hamiltonian cycle, C’. A perfect matching 
of G is formed by using alternate edges of C. O 


In 1947 Tutte [269] provided perhaps the best known characterization of graphs 
with perfect matchings. A number of proofs of Tutte’s Theorem have been pub- 
lished since then. The proof that we present is due to Anderson [7]. 

A definition first: Given a graph G, define 2.(G) to be the number of connected 
components of G with odd order. Also, define 4(G) to be the number of con- 
nected components of G with even order. 


Theorem 1.59 (Tutte’s Theorem). Let G be a graph of order n > 2. G has a 
perfect matching if and only if Q(G — S) < |S| for all subsets of S of V(G). 


Proof. We begin with the forward direction. Let G be a graph that has a perfect 
matching. Suppose S is a set of vertices and that O,, O2,..., Ox are the odd 
components of G' — S. For each 3, the vertices in O; can be adjacent only to other 
vertices in O; and to vertices in S. Since G has a perfect matching, at least one 
vertex out of each of the O;’s has to be matched with a different vertex in S. If 
k, > |S|, then some O; would be left out (Figure 1.117). Thus, & < |S]. 


FIGURE 1.117. 


For the reverse direction of the theorem, suppose that |S] > Q(G — S) for all 
S. In particular, if S = 0, then Q(G — 0) < 0. This implies that there are no odd 
components of G—every component of G is even. More generally, we make the 
following claim. 
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Claim A. For any proper subset S,, |.S| and Q(G — S) are either both even 
or both odd. 


Let C be some component of G’. We know from earlier that C’ has even order. If an 
even number of vertices is removed from C’, then the number of odd components 
remaining must also be even. If an odd number of vertices is removed from C,, 
then the number of odd components remaining must be odd. Since this is true for 
every component of G,, it is true for all of G. Hence Claim A is proved. 

We now proceed by induction on n, the order of the graph. If n = 2, then G 
is Kz, which certainly has a perfect matching. Suppose now that the result is true 
for all graphs of even order up to n, and let G be a graph of even order n. We now 
have two cases. 

Case 1. Suppose that for every proper subset 5, Q(G — 8) < |S]. (That is, the 
strict inequality holds.) Claim A implies that |.S| and Q(G — S) have the same 
parity, so we can say in this case that for all subsets S, Q(G — S) < |S| — 2. Let 
uv © E(G), and consider the graph G — u — v (a graph with two fewer vertices 
than G). We would like to apply the induction hypothesis to G — u — v, so we 
need the following claim. 


Claim B. For all subsets S’ of V(G — u— v), O(G —u—v—S") < |S". 


If Claim B were not true, then 2(G — u—v— 5S) > |S| for some subset S). But 
since |.S)| = |S, U {u, v}| — 2, we get Q(G — u— v — $1) > |S, U {u, v}], and 
this contradicts the assumption in this case. Claim B is proved. 

Since Claim B is true, we can apply the induction hypothesis to G — u — v. 
That is, we can conclude that G' — u — v has a perfect matching. This matching, 
together with the edge wv, forms a perfect matching of G. Case | is complete. 

Case 2. Suppose there exists a subset S' such that Q(G — S') = |S|. There 
may be a number of subsets S' that satisfy this condition—suppose without loss 
of generality that S is a largest such set. Let Oi, Oo, ..., Ox be the components 
of G — S of odd order. 


Claim C. 5(G — S) = 0. That is, there are no even-ordered components of 
G—S. 


Let E be an even ordered component of G — S, and let x be a vertex of E. 
The graph G — S — x has exactly one more odd component than G — S. Thus, 
|S U {x}| = O(G — S — x). But this means that S U {2} is a set larger than S' 
that satisfies the assumption of this case. Since we chose S to be the largest, we 
have a contradiction. Therefore there are no even-ordered components of G — S. 
Claim C is proved. 


Claim D. There exist vertices 51, 52,..., 5% € Sand vertices v1, v2,..., Uk, 
where for each i uv; € O;, such that {v1 51, v252,..., x5, } is a matching. 


For each i € {1,...,k}, define the set S; to be the set of vertices in S that 
are adjacent to some vertex in O;. Note that if S; = @ for some i, then O; is 
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completely disconnected from anything else in G, implying that G itself has an 
odd component. Since this contradicts our assumption in this case, we can assume 
that each S; is nonempty. Furthermore, our initial assumptions imply that the 
union of any r of the S;’s has size at least r. Thus, the condition in Theorem 1.52 
is satisfied, implying that there exists a system of distinct representatives for the 
family of sets S), So, ..., Sx. If we let these representatives be s1, so,..., Sk, 
and their adjacencies in the O,’s be v1, v2, ..., Ug, then Claim D is proved. 
The situation in G is depicted in Figure 1.118, where & = |S]. 


0; 


O; 


Or 
FIGURE 1.118. 


At this point, each vertex in S' has been matched to a vertex in an O;. The goal 
at this point is to show that each O; — v; has a perfect matching. 
Let W be some subset of vertices of (the even-ordered) O; — v;. 


If Q(O; =U = W) > |W 
then, 


, then, by Claim A, 2(O; — v; — W) > |W] + 2. But 


O(G —S—v,—-W) > |S|-—1+|W|4+2 
= |S|+|W|+1 
= |SUWU {u;}|. 
This contradicts our assumption, and thus Claim E is proved. 
Since Claim E is true, each O; — v; satisfies the induction hypothesis, and 
thus has a perfect matching. These perfect matchings together with the perfect 


matching shown in Figure 1.118 form a perfect matching of G, and so Case 2 is 
complete. O 


We conclude this section by considering perfect matchings in regular graphs. If 
a graph G is 1-regular, then G itself is a perfect matching. If G' is 2-regular, then 
G is a collection of disjoint cycles; as long as each cycle is even, G will have a 
perfect matching. 

What about 3-regular graphs? A graph that is 3-regular must be of even order, 
so is it possible that every 3-regular graph contains a perfect matching? In a word, 
no. The graph in Figure 1.119 is a connected 3-regular graph that does not have a 
perfect matching. Thanks to Petersen [221], though, we do know of a special class 
of 3-regular graphs that do have perfect matchings. Recall that a bridge in a graph 
is an edge whose removal would disconnect the graph. The graph in Figure 1.119 
has three bridges. 
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FIGURE 1.119. 


Theorem 1.60 (Petersen’s Theorem). Every bridgeless, 3-regular graph contains 
a perfect matching. 


Proof. Let G' be a bridgeless, 3-regular graph, and suppose that it does not contain 
a perfect matching. By Tutte’s Theorem, there must exist a subset S' of vertices 
where the number of odd components of G — S is greater than |.S'|. Denote the 
odd-ordered components of G — S' by Oj, Og, ..., Ox. 

First, each O; must have at least one edge into S. Otherwise, there would exist 
an odd-ordered, 3-regular subgraph of G, and this is not possible, by Theorem 1.1. 
Second, since G is bridgeless, there must be at least two edges joining each O; to 
S. Moreover, if there were only two edges joining some O; to 5’, then O; would 
contain an odd number of vertices with odd degree, and this cannot happen. 

We can therefore conclude that there are at least three edges joining each O; to 
S. This implies that there are at least 3k edges coming into S from the O,’s. But 
since every vertex of S has degree 3, the greatest number of edges incident with 
vertices in S is 3|.S|, and since 3k > 3|.S|, we have a contradiction. Therefore, G 
must have a perfect matching. O 


It is probably not surprising that the Petersen of Theorem 1.60 is the same 
person for whom the Petersen graph (Figure 1.63) is named. 

Petersen used this special graph as an example of a 3-regular, bridgeless graph 
whose edges cannot be partitioned into three separate, disjoint matchings. 


Exercises 


1. Find a maximum matching of the graph shown in Figure 1.119. 


2. Use Tutte’s Theorem to prove that the graph in Figure 1.119 does not have 
a perfect matching. 


3. Draw a connected, 3-regular graph that has both a cut vertex and a perfect 
matching. 
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. Determine how many different perfect matchings there are in Kp». 
. Prove Corollary 1.57. 


. Characterize when K;., .,....r, has a perfect matching. 


eats 


. Prove that every tree has at most one perfect matching. 


oN Dn Wn fF 


. Let G be a subgraph of K20,20. If G has a perfect matching, prove that G 
has at most 190 edges that belong to no perfect matching. 


9. Use Tutte’s Theorem to prove Hall’s Theorem. 


1.8 Ramsey Theory 


I have to go and see some friends of mine, some that I don’t know, 
and some who aren’t familiar with my name. 
— John Denver, Goodbye Again 


We begin this section with a simple question: How many people are required at 
a gathering so that there must exist either three mutual acquaintances or three 
mutual strangers? We will answer this question soon. 

Ramsey theory is named for Frank Ramsey, a young man who was especially 
interested in logic and philosophy. Ramsey died at the age of 26 in 1930—the 
same year that his paper On a problem of formal logic was published. His pa- 
per catalyzed the development of the mathematical field now known as Ramsey 
theory. The study of Ramsey theory has burgeoned since that time. While many 
results in the subject are published each year, there are many questions whose 
answers remain elusive. As the authors of [136] put it, “the field is alive and ex- 
citing.” 


1.8.1 Classical Ramsey Numbers 


An innocent looking problem often gives no hint as to its true nature. 
— Paul Erdés [92] 


A 2-coloring of the edges of a graph G is any assignment of one of two colors to 
each of the edges of G. Figure 1.120 shows a 2-coloring of the edges of A’; using 
red (thick) and blue (thin). 

Let p and q be positive integers. The (classical) Ramsey number associated with 
these integers, denoted by R(p, q), is defined to be the smallest integer n such that 
every 2-coloring of the edges of K,, either contains a red K, or a blue Ky asa 
subgraph. 

Read through that definition at least one more time, and then consider this sim- 
ple example. We would like to find the value of R(1,3). According to the defi- 
nition, this is the least value of n such that every 2-coloring of the edges of K;, 
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FIGURE 1.120. A 2-coloring of Ks. 


either contains as a subgraph a Ky all of whose edges are red, or a K3 all of 
whose edges are blue. How many vertices are required before we know that we 
will have one of these objects in every coloring of a complete graph? If you have 
just one vertex, then no matter how you color the edges (ha-ha) of /;, you will 
always end up with ared A. Thus, R(1,3) = 1. We have found our first Ramsey 
number! 

We should note here that the definition given for Ramsey number is in fact 
a good definition. That is, given positive integers p and g, R(p,q) does in fact 
exist. Ramsey himself proved this fact, and we will learn more about the proof of 
“Ramsey’s Theorem” in Chapter 2. 

Back to examples. We just showed that R(1,3) = 1. Similar reasoning shows 
that R(1,k) = 1 for all positive integers k (see Exercise 2). 

How about (2, 4)? We need to know the smallest integer n such that every 2- 
coloring of the edges of K,,, contains either a red ‘2 or a blue ‘4. We claim that 
R(2,4) = 4. To show this, we must demonstrate two things: first, that there exists 
a 2-coloring of ‘3 that contains neither a red K‘2 nor a blue K4, and second, that 
any 2-coloring of the edges of /’4 contains at least one of these as a subgraph. 

We demonstrate the first point. Consider the 2-coloring of K3 given in Fig- 
ure 1.121 (recall that red is thick and blue is thin—the edges in this coloring are 
all blue). This coloring of /3 does not contain a red Ko, and it certainly does not 


FIGURE 1.121. The edges of K3 colored blue. 


contain a blue Ay. Thus R(2, 4) > 3. 

For the second point, suppose that the edges of Ky are 2-colored in some fash- 
ion. If any of the edges are red, then we have a red Ko. If none of the edges are 
red, then we have a blue Ky. So, no matter the coloring, we always get one of the 
two. This proves that R(2,4) = 4. 

What do you think is the value of R(2,5)? How about R(2,34)? As you will 
prove in Exercise 3, R(2,k) = k for all integers k > 2. 
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Exercises 


1. How many different 2-colorings are there of K3? of K4? of Ks? of Ky9? 
2. Write a nice proof of the fact that R(1, k) = 1 for all positive integers k. 
3. Write a nice proof of the fact that R(2,k) = k for integers k > 2. 

4. Prove that for positive integers p and q, R(p,q) = R(q,p). 


5. If2 <p’ < pand2 < q’ < q, then prove that R(p’, q’) < R(p,q). Also, 
prove that equality holds if and only if p’ = p and q' = gq. 


1.8.2 Exact Ramsey Numbers and Bounds 


Take me to your leader. 
— proverbial alien 


How many people are required at a gathering so that there must exist either three 
mutual acquaintances or three mutual strangers? We can rephrase this question 
as a problem in Ramsey theory: How many vertices do you need in an (edge) 2- 
colored complete graph for it to be necessary that there be either a red ‘3 (people 
who know each other) or a blue ‘3 (people who do not know each other)? As the 
next theorem states, the answer is 6. 


Theorem 1.61. R(3,3) = 6. 


Proof. We begin the proof by exhibiting (in Figure 1.122) a 2-coloring of the 
edges of Ks that produces neither a red (thick) /’3 nor a blue (thin) K3. This 


FIGURE 1.122. A 2-coloring of the edges of Ks. 


2-coloring of K; demonstrates that R(3,3) > 5. Now consider K¢, and suppose 
that each of its edges has been colored red or blue. Let v be one of the vertices of 
Kg. There are five edges incident with v, and they are each colored red or blue, 
so it must be that v is either incident with at least three red edges or at least three 
blue edges (think about this; it is called the Pigeonhole Principle—more on this 
in later chapters). Without loss of generality, let us assume that v is incident with 
at least three red edges, and let us call them vx, vy, and vz (see Figure 1.123). 
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FIGURE 1.123. 


FIGURE 1.124. 


Now, if none of the edges xy, xz, yz is colored red, then we have a blue K3 
(Figure 1.124). 

On the other hand, if at least one of xy, xz, yz is colored red, we have ared K3 
(Figure 1.125). 


FIGURE 1.125. 


Therefore, any 2-coloring of the edges of Ag produces either a red 3 or a blue 
K3. O 


Let us determine another Ramsey number. 


Theorem 1.62. R(3,4) = 9. 


Proof. Consider the 2-coloring of the edges of Kg given in Figure 1.126. 

A bit of examination reveals that this coloring produces no red (thick) A’3 and 
no blue (thin) Ay. Thus, R(3,4) > 9. We now want to prove that R(3,4) < 9, 
and we will use the facts that R(2,4) = 4 and R(3,3) = 6. 

Let G be any complete graph of order at least 9, and suppose that the edges of 
G have been 2-colored arbitrarily. Let v be some vertex of G. 

Case 1. Suppose that v is incident with at least four red edges. Call the end 
vertices of these edges “red neighbors” of v, and let S be the set of red neighbors 
of v (see Figure 1.127). 
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FIGURE 1.126. A 2-coloring of the edges in K’g. 


FIGURE 1.127. 


Since S contains at least four vertices, and since R(2,4) = 4, the 2-coloring 
of the edges that are within S must produce either a red Ke or a blue Ky within 
S itself. If the former is the case, then we are guaranteed a red K3 in G (see 
Figure 1.128). If the latter is the case, then we are clearly guaranteed a blue Ky 
in G. 


FIGURE 1.128. 


Case 2. Suppose that v is incident with at least six blue edges. Call the other 
end vertices of these edges “blue neighbors” of v, and let T’ be the set of blue 
neighbors of v (see Figure 1.129). 

Since T contains at least six vertices, and since R(3, 3) = 6, the 2-coloring of 
the edges that are within 7’ must produce either a red K3 or a blue K3 within T’ 
itself. If the former is the case, then we are obviously guaranteed a red [3 in G. 
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FIGURE 1.129. 


If the latter is the case, then we are guaranteed a blue K’, in G (see Figure 1.130). 


L} 


FIGURE 1.130. 


Case 3. Suppose that v is incident with fewer than four red edges and fewer than 
six blue edges. In this case there must be at most nine vertices in G altogether, 
and since we assumed at the beginning that the order of G is at least 9, we can 
say that G has order exactly 9. Further, we can say that v is incident with exactly 
three red edges and exactly five blue edges. And since the vertex v was chosen 
arbitrarily, we can assume that this holds true for every vertex of G. 

Now if we consider the underlying “red” subgraph of G, we have a graph with 
nine vertices, each of which has degree 3. But this cannot be, since the number 
of vertices in G with odd degree is even (the First Theorem of Graph Theory). 
Therefore, this case cannot occur. 

We have therefore proved that any 2-coloring of the edges of a complete 
graph on 9 vertices (or more) produces either a red K’3 or a blue /‘4. Hence, 
R(3,4) = 9. O 
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Some known Ramsey numbers are listed below. 


R(1,k) =1, 
R(2,k) =k, 
R(3,3) =6, R(3,4) =9, R(3,5) = 14, R(3,6) = 18, 
R(3,7) = 23, R(3,8) = 28, R(3, 9) = 36, 
R(4,4) = 18, R(4,5) = 25. 


Bounds on Ramsey Numbers 


Determining exact values of Ramsey numbers is extremely difficult in general. Jn 
fact, the list given above is not only a list of some known Ramsey numbers, it is 
a list of all known Ramsey numbers. Many people have attempted to determine 
other values, but to this day no other numbers are known. 

However, there has been progress in finding bounds, and we state some im- 
portant ones here. The proofs of the first two theorems will be discussed in Sec- 
tion 2.10.2 (see Theorem 2.28 and Corollary 2.29). The first bound is due to Erdés 
and Szekeres [94], two major players in the development of Ramsey theory. Their 
result involves a quotient of factorials: Here, n! denotes the product 1 -2---n. 


Theorem 1.63. For positive integers p and q, 


(p+q—2)! 
(p= Dig—1)t 


The next theorem gives a bound on R(p, q) based on “previous” Ramsey num- 
bers. 


R(p,q) < 


Theorem 1.64. If p > 2 and q > 2, then 


R(p,q) < R(p— 1,4) + R(p,q- 1). 


Furthermore, if both terms on the right of this inequality are even, then the in- 
equality is strict. 


The following bound is for the special case p = 3. 
Theorem 1.65. For every integer q > 3, 
g+3 
a 


R(3,q) < 


The final bound that we present is due to Erdés [90]. It applies to the special 
case p = q. In the theorem, |x| denotes the greatest integer less than or equal to 
a 


Theorem 1.66. [fp > 3, then 


R(p,p) > 2". 
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A number of other specific bounds are known: 


35 < R(4,6) < 41, 

43 < R(5,5) < 49, 

58 < R(5,6) < 87, 
102 < R(6,6) < 165. 


Even with the sophisticated computing power that is available to us today, we 
are not able to compute values for more than a handful of Ramsey numbers. Paul 
Erdés once made the following comment regarding the difficulty in finding exact 
values of Ramsey numbers [63]: 


Suppose an evil alien would tell mankind “Either you tell me [the 


value of R(5,5)] or I will exterminate the human race.” ... It would 
be best in this case to try to compute it, both by mathematics and with 
a computer. 


If he would ask [for the value of R(6, 6)], the best thing would be 
to destroy him before he destroys us, because we couldn’t [determine 
R(6,6)]. 

Exercises 


1. Prove that R(3,5) > 14. The graph in Figure 1.131 will be very helpful. 


= 
WX 


FIGURE 1.131. A 2-coloring of Ky3. 


2. Use Theorem 1.64 and the previous exercise to prove that R(3,5) = 14. 
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3. Construct a graph and a 2-coloring that proves R(4,4) > 18. 
4. Use Theorem 1.64 and the previous exercise to prove that R(4,4) = 18. 


5. Use Theorem 1.64 to prove Theorem 1.65. 


1.8.3 Graph Ramsey Theory 


All generalizations are dangerous, even this one. 
— Alexandre Dumas 


Graph Ramsey theory is a generalization of classical Ramsey theory. Its develop- 
ment was due in part to the search for the elusive classical Ramsey numbers, for it 
was thought that the more general topic might shed some light on the search. The 
generalization blossomed and became an exciting field in itself. In this section 
we explain the concept of graph Ramsey theory, and we examine several results. 
These results, and more like them, can be found in [136]. 

Given two graphs G and H, define the graph Ramsey number R(G, H) to be 
the smallest value of n such that any 2-coloring of the edges of /,, contains either 
ared copy of G' or a blue copy of H. The classical Ramsey number R(p, q) would 
in this context be written as R(IX,, Kq). 

The following simple result demonstrates the relationship between graph Ram- 
sey numbers and classical Ramsey numbers. 


Theorem 1.67. If G is a graph of order p and H is a graph of order q, then 
R(G, H) < R(p,q). 


Proof. Let n = R(p,¢q), and consider an arbitrary 2-coloring of K,,. By defini- 
tion, K,, contains either a red K, orablue K,.SinceG C K, and H C Kg, there 
must either be a red G ora blue H in K,,. Hence, R(G, H) <n=R(p,q). O 


Here is a result due to Chvatal and Harary [55] that relates R(G, H) to the 
chromatic number of G, y(G), and the order of the largest component of H, 
denoted by C( #7). 


Theorem 1.68. R(G, H) > (x(G) — 1)(C(A) — 1) +1. 


Proof. Letm = x(G) — 1 and let n = C(#H) — 1. Consider the graph S formed 
by taking m copies of K,, and adding all of the edges in between each copy 
(Figure 1.132). Actually, S = Kin. Now color all of the edges within each 
ix, blue, and color all other edges red. From the way we have constructed the 
coloring, every red subgraph can be vertex colored with m colors. Since m < 
x(G), there can be no red G present. Furthermore, any blue subgraph has at most 
n = C(f) — 1 vertices in its largest component. Hence, there can be no blue H 
present. We have exhibited a 2-coloring of K’,,,, that contains neither a red G nor 
a blue H. O 
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QA? 
[| | : 
FIGURE 1.132. The graph S. 


The next few theorems give exact graph Ramsey numbers for specific classes 
of graphs. The first is due to Chvatal [53], and the proof uses a few ideas from 
previous sections. 


Theorem 1.69. [f T;,, is a tree with m vertices, then 


Proof. If m = 1 orn = 1, then R(T,,, K,) = 1 and the result holds. Assume 
then that m and n are both at least 2. 


Claim A. R(Im, Kn) > (m—1)(n—1) +1. 


Consider the graph that consists of n — 1 copies of K,,—1, with all possible edges 
between the copies of K,,—1. This graph is actually K(m—1)(n—1). Color the 
edges in each K,,—; red, and color all of the other edges blue. Since each of 
the red subgraphs has order m — 1, no red T;,, can exist. Also, by this construc- 
tion, no blue K,, can exist. Since this 2-coloring contains no red T;,, and no blue 
Kn, it must be that R(Tm, Kn) > (m—1)(n—1) +1. 


Claim B. R(T, Kn) < (m—1)(n—1) +1. 


Let G be K(m_—1)(n—1)41, and suppose that its edges have been arbitrarily 2- 
colored. Let G, denote the subgraph of G formed by the red edges, and let G, 
denote the subgraph of G formed by the blue edges. If there is no blue K,,, then 
w(Gp) < n—1, and if so, then a(G,.) < n—1, since G, is the complement of G). 
Thus by Theorem 1.45, y(G;) > m. Let H be a subgraph of G’, that is m-critical. 
By part (d) of Exercise 6 in Section 1.6.1, 6(H) > m — 1. By Theorem 1.16, H 
contains T;,, as a subgraph, and therefore G has a red T,,,. Oo 


The next theorem is due to Burr [46]. 


Theorem 1.70. If T,,, is a tree of order m and if m — 1 divides n — 1, then 


R(Im, Kin) =mMm+n-—1. 
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In the following theorem, m/c2 denotes the graph consisting of m copies of 
Ko, and nk has a similar meaning. 


Theorem 1.71. [fm > n > 1, then 
R(mKo,nK2) =2m+n-—1. 


As we mentioned earlier, these results apply to specific classes of graphs. In 
general, determining values of R(G, ) is quite difficult. So the generalization 
that was intended to solve hard classical Ramsey problems has produced hard 
problems of its own! 


Exercises 
1. Find R(P3, Ps). 
2. Find R(P3, C1). 
3. Find R(C1, C1). 
4. Prove that R( i713, h1,3) = 6. 
5. Prove that R(2K3, K3) = 8. 


1.9 References 


Prince John: Are you finished? 
Sir Robin of Locksley: I’m only just beginning. 
— Robin Hood 


We have only just begun our walk through the field of graph theory. In this section 
we will provide references for those who are interested in further study. 

The books by Chartrand and Lesniak [52], Buckley and Lewinter [43] and West 
[281] provide very thorough introductions to a large number of topics in graph 
theory. The graduate-level texts by Diestel [75] and Bollobas [29], along with of- 
fering further study of the concepts covered in this chapter, also cover network 
flows, extremal graph theory, and random graphs. Gould’s book [128] covers a 
number of graph algorithms, from finding maximum matchings to testing pla- 
narity. Many interesting applications of graph theory can be found in texts by 
Gross and Yellen [139], Foulds [106], and Roberts and Tesman [238]. A good 
source for connections between graph theory and other mathematical topics is 
[21], edited by Beineke and Wilson. The text [148] by Harary is a thorough dis- 
cussion of counting various types of graphs. A wonderful source for the history of 
graph theory and some of its famous problems is the book [26] by Biggs, Lloyd, 
and Wilson. 

Buckley and Harary [42] have a nice text which focuses on distance in graphs. 
For more on the development of the Four Color Problem, see the books by Wilson 
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[285] and Aigner [3]. Much more information regarding Ramsey theory can be 
found in the book [136] by Graham, Rothschild, and Spencer. Also, Radziszowski 
[231] maintains a list of current bounds on small Ramsey numbers. 

The book by Barabasi [17] is a nice general treatment of graphs (networks) and 
their relationship to all sorts of phenomena. Finally, the books by Hoffman [163] 
and Schechter [247] tell the story of Paul Erdés, a twentieth century giant in the 
field of graph theory. 
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Combinatorics 


[Combinatorics] has emerged as a new subject standing at the 
crossroads between pure and applied mathematics, the center of 
bustling activity, a simmering pot of new problems and exciting 
speculations. 

— Gian-Carlo Rota, [243, p. vii] 


The formal study of combinatorics dates at least to Gottfried Wilhelm Leibniz’s 
Dissertatio de Arte Combinatoria in the seventeenth century. The last half-century, 
however, has seen a huge growth in the subject, fueled by problems and applica- 
tions from many fields of study. Applications of combinatorics arise, for example, 
in chemistry, in studying arrangements of atoms in molecules and crystals; biol- 
ogy, in questions about the structure of genes and proteins; physics, in problems 
in statistical mechanics; communications, in the design of codes for encryption, 
compression, and correction of errors; and especially computer science, for in- 
stance in problems of scheduling and allocating resources, and in analyzing the 
efficiency of algorithms. 

Combinatorics is, in essence, the study of arrangements: pairings and group- 
ings, rankings and orderings, selections and allocations. There are three principal 
branches in the subject. Enumerative combinatorics is the science of counting. 
Problems in this subject deal with determining the number of possible arrange- 
ments of a set of objects under some particular constraints. Existential combi- 
natorics studies problems concerning the existence of arrangements that possess 
some specified property. Constructive combinatorics is the design and study of 
algorithms for creating arrangements with special properties. 


J.M. Harris et al., Combinatorics and Graph Theory, DOI: 10.1007/978-0-387-797 1 1-3_2, 
© Springer Science+Business Media, LLC 2008 


130 2. Combinatorics 


Combinatorics is closely related to the theory of graphs. Many problems in 
graph theory concern arrangements of objects and so may be considered as com- 
binatorial problems. For example, the theory of matchings and Ramsey theory, 
both studied in the previous chapter, have the flavor of existential combinatorics, 
and we continue their study later in this chapter. Also, combinatorial techniques 
are often employed to address problems in graph theory. For example, in 
Section 2.5 we determine another method for finding the chromatic polynomial 
of a graph. 

We focus on topics in enumerative combinatorics through most of this chapter, 
but turn to some questions in existential combinatorics in Sections 2.4 and 2.10, 
and to some problems in constructive combinatorics in Sections 2.9 and 2.10. 
Throughout this chapter we study arrangements of finite sets. Chapter 3 deals 
with arrangements and combinatorial problems involving infinite sets. Our study 
in this chapter includes the investigation of the following questions. 


e Should a straight beat a flush in the game of poker? What about a full house? 


e Suppose a lazy professor collects a quiz from each student in a class, then 
shuffles the papers and redistributes them randomly to the class for grading. 
How likely is it that no one receives his or her own quiz to grade? 


e How many ways are there to make change for a dollar? 


e How many different necklaces with twenty beads can be made using rhodo- 
nite, rose quartz, and lapis lazuli beads, if a necklace can be worn in any 
orientation? 


e How many seating arrangements are possible for n guests attending a wed- 
ding reception in a banquet room with k round tables? 


e Suppose 100 medical students rank 100 positions for residencies at hospi- 
tals in order of preference, and the hospitals rank the students in order of 
preference. Is there a way to assign the students to the hospitals in such a 
way that no student and hospital prefer each other to their assignment? Is 
there an efficient algorithm for finding such a matching? 


e Is it possible to find a collection of n > 3 points in the plane, not all on the 
same line, so that every line that passes through two of the points in fact 
passes through a third? Or, if we require instead that no three points lie on 
the same line, can we arrange a large number of points so that no subset of 
them forms the vertices of a convex octagon? 


2.1 Some Essential Problems 
The mere formulation of a problem is far more essential than its 


solution. .. 
— Albert Einstein 
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We begin our study of combinatorics with two essential observations that underlie 
many counting strategies and techniques. The first is a simple observation about 
counting when presented with a number of alternative choices. 


Sum Rule. Suppose $1, S2,..., Sm are mutually disjoint finite sets, and |S;| = n; 
for 1 <%<™m. Then the number of ways to select one object from any of the sets 
Si, So,..., Sm is the sumn, + ng +-+++m. 


We often use the sum rule implicitly when solving a combinatorial problem 
when we break the set of possible outcomes into several disjoint cases, each of 
which can be analyzed separately. For example, suppose a coy college athlete 
tells us that his two-digit jersey number is divisible by 3, its first digit is odd, and 
its second digit is less than its first. How many numbers satisfy these criteria? A 
natural approach is to break the problem into five cases based on the first digit. 
Analyzing each of 1, 3, 5, 7, and 9 in turn, we find the possibilities are {}, {30}, 
{51,54}, {72,75}, or {90, 93, 96}, so there are eight possible jersey numbers in 
all. 

The second essential observation concerns counting problems where selections 
are made in sequence. 


Product Rule. Suppose $1, S2, ..., Sm are finite sets, and |S;| = n; for 1 < 
i <m. Then the number of ways to select one element from 5}, followed by one 
element from Sz, and so on, ending with one element from Sm, is the product 
N4N2Q°++Nm, provided that the selections are independent, that is, the elements 
chosen from 5S}, ..., S;—1 have no bearing on the selection from Sj, for each i. 


For example, consider the number of m-letter acronyms that can be formed 
using the full alphabet. To construct such an acronym, we make m choices in 
sequence, one for each position, and each choice has no effect on any subsequent 
selection. Thus, by the product rule, the number of such acronyms is 26”. 

We can apply a similar strategy to count the number of valid phone numbers 
in the U.S. and Canada. Under the North American Numbering Plan, a phone 
number has ten digits, consisting of an area code, then an exchange, then a station 
code. The three-digit area code cannot begin with 0 or 1, and its second digit can 
be any number except 9. The three-digit exchange cannot begin with 0 or 1, and 
the station code can be any four-digit number. Using the product rule, we find that 
the number of valid phone numbers under this plan is (8-9 - 10) - (8-10?) -104 = 
5 760 000 000. 

One might object that certain three-digit numbers are service codes reserved 
for special use in many areas, like 411 for information and 911 for emergencies. 
Let’s compute the number of valid phone numbers for which neither the area code 
nor the exchange end with the digits 11. The amended number of area codes is 
then 8(9 - 10 — 1) = 712, and for exchanges we obtain 8 - 99 = 792. Thus, the 
number of valid phone numbers is 712 - 792 - 10* = 5 639 040 000. 

We can use the product rule to solve three basic combinatorial problems. 
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Problem 1. How many ways are there to order a collection of n different objects? 


For example, how many ways are there to arrange the cards in a standard deck 
of 52 playing cards by shuffling? How many different batting orders are possible 
among the nine players on a baseball team? How many ways are there to arrange 
ten books on a shelf? 

To order a collection of n objects, we need to pick one object to be first, then 
another one to be second, and another one third, and so on. There are n different 
choices for the first object, then n — 1 remaining choices for the second, and n — 2 
for the third, and so forth, until just one choice remains for the last object. The 
total number of ways to order the n objects is therefore the product of the integers 
between 1 and n. This number, called n factorial, is written n!. An ordering, or 
rearrangement, of n objects is often called a permutation of the objects. Thus, the 
number of permutations of 7 items is n!. 

Our second problem generalizes the first one. 


Problem 2. How many ways are there to make an ordered list of / objects from a 
collection of n different objects? 


For example, how many ways can a poll rank the top 20 teams in a college sport 
if there are 100 teams in the division? How many ways can a band arrange a play 
list of twelve songs if they know only 25 different songs? 

Applying the same reasoning used in the first problem, we find that the answer 
to Problem 2 is the product n(n — 1)(n — 2)-+-(n—k +1), or n!/(n— k)!. This 
number is sometimes denoted by P(n, k), but products like this occur frequently 
in combinatorics, and a more descriptive notation is often used to designate them. 

We define the falling factorial power x* as a product of k terms beginning with 
x, with each successive term one less than its predecessor: 


k-1 
ok = x(a —1)(a—2)---(n-k+1) = |] @—-?. (2.1) 
i=0 


The expression x is pronounced “x to the k falling.” Similarly, we define the 
rising factorial power x* (“x to the k rising”) by 


k-1 
x® = 2(e + 1)(e+2)---(e+k-1) = ][ +9). (2.2) 
i=0 


Thus, we see that P(n,k) = nk = (n—k+1)*, and n! = n® = 1". Also, the 
expressions n2, n°, and 0! all represent products having no terms at all. Multiply- 
ing any expression by such an empty product should not disturb the value of the 
expression, so the value of each of these degenerate products is taken to be 1. 


Our third problem concerns unordered selections. 
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Problem 3. How many ways are there to select k objects from a collection of n 
objects, if the order of selection is irrelevant? 


For example, how many different hands are possible in the game of poker? A 
poker hand consists of five cards drawn from a standard deck of 52 different cards. 
The order of the cards in a hand is unimportant, since players can rearrange their 
cards freely. 

The solution to Problem 3 is usually denoted by Cs or sometimes C'(n, k). 
The expression C) is pronounced “n choose k.” 

We can find a formula for (7) by using our solutions to Problems | and 2. 
Since there are k! different ways to order a collection of k objects, it follows that 


the product ( TDR! is the number of possible ordered lists of k objects selected 
from the same collection of n objects. Therefore, 


n n& n! 
& 7 Bint ee 


The numbers ( fe are called binomial coefficients, for reasons discussed in the 
next section. The binomial coefficients are ubiquitous in combinatorics, and we 
close this section with a few applications of these numbers. 


1. The number of different hands in poker is (*?) = 5223/5! = 2598 960. The 
number of different thirteen-card hands in the game of bridge is (73) = 
635 013 559 600. 


2. To play the Texas lottery game Lotto Texas, a gambler selects six differ- 
ent numbers between | and 54. The order of selection is unimportant. The 
number of possible lottery tickets is therefore (=) = 25 827 165. 


3. Suppose we need to travel m blocks east and n blocks south in a regular 
grid of city streets. How many paths are there to our destination if we travel 
only east and south? 


We can represent a path to our destination as a sequence by, bo, ..., bn+m; 
where b; represents the direction we are traveling during the 7th block of 
our route. Exactly m of the terms in this sequence must be “east,” and there 
are precisely Co) ways to select m positions in the sequence to have this 
value. The remaining n positions in the sequence must all be “south,” so 


the number of possible paths is (""'") = (tn) 2 


4. A standard deck of playing cards consists of four suits (spades, hearts, 
clubs, and diamonds), each with thirteen cards. Each of the cards in a suit 
has a different face value: a number between 2 and 10, or a jack, queen, 
king, or ace. How many poker hands have exactly three cards with the same 
face value? 


We can answer this question by considering how to construct such a hand 
through a sequence of simple steps. First, select one of the thirteen different 
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face values. Second, choose three of the four cards in the deck having this 
value. Third, pick two cards from the 48 cards having a different face value. 
By the product rule, the number of possibilities is 


(1) (3) (a) = 80 


Poker aficionados will recognize that this strategy counts the number of 
ways to deal either of two different hands in the game: the “three of a kind” 
and the stronger “full house.” A full house consists of a matched triple 
together with a matched pair, for example, three jacks and two aces; a three 
of a kind has only a matched triple. The number of ways to deal a full house 


(1) @)G)G) = 


since choosing a matched pair involves first selecting one of twelve differ- 
ent remaining face values, then picking two of the four cards having this 
value. The number of three of a kind hands is therefore 58 656 — 3744 = 
54912. 


We can also compute this number directly by modifying our first strategy. 
To avoid the possibility of selecting a matched pair in the last step, we can 
replace the term (*) = 48 - 47/2 by 48 - 44/2, since the face value of 
the last card should not match any other card selected. Indeed, we calculate 
13-4-48- 44/2 = 54912. Notice that dividing by 2 is required in the last 
step, since the last two cards may be selected in any order. 


Exercises 


1. In the C++ programming language, a variable name must start with a letter 
or the underscore character (_), and succeeding characters must be letters, 
digits, or the underscore character. Uppercase and lowercase letters are con- 
sidered to be different characters. 


(a) How many variable names with exactly five characters can be formed 
in C++? 
(b) How many are there with at most five characters? 


(c) How many are there with at most five characters, if they must read 
exactly the same forwards and backwards? For example, kayak and 
T55T are admissible, but Kayak is not. 


2. Assume that a vowel is one of the five letters A, E, I, O, or U. 


(a) How many eleven-letter sequences from the alphabet contain exactly 
three vowels? 


(b) How many of these have at least one repeated letter? 
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3. There are 30 teams in the National Basketball Association: 15 in the West- 
ern Conference, and 15 in the Eastern Conference. 


(a) Suppose each of the teams in the league has one pick in the first round 
of the NBA draft. How many ways are there to arrange the order of 
the teams selecting in the draft? 


(b) Suppose that each of the first three positions in the draft must be 
awarded to one of the fourteen teams that did not advance to the play- 
offs that year. How many ways are there to assign the first three posi- 
tions in the draft? 


(c) How many ways are there for eight teams from each conference to 
advance to the playoffs, if order is unimportant? 


(d) Suppose that every team has three centers, four guards, and five for- 
wards. How many ways are there to select an all-star team with the 
same composition from the Western Conference? 


4. According to the Laws of the Game of the International Football Associa- 
tion, a full football (soccer) team consists of eleven players, one of whom 
is the goalkeeper. The other ten players fall into one of three outfield posi- 
tions: defender, midfielder, and striker. There is no restriction on the number 
of players at each of these positions, as long as the total number of outfield 
players is ten. 


(a) How many different configurations are there for a full football team? 
For example, one team may field four strikers, three midfielders, and 
three defenders, in addition to the goalkeeper. Another may play five 
strikers, no midfielders, and five defenders, plus the goalkeeper. 


(b) Repeat the previous problem if there must be at least two players at 
each outfield position. 


(c) How many ways can a coach assign eleven different players to one of 
the four positions, if there must be exactly one goalkeeper, but there 
is no restriction on the number of players at each outfield position? 


5. A political science quiz has two parts. In the first, you must present your 
opinion of the four most influential secretaries-general in the history of the 
United Nations in a ranked list. In the second, you must name ten members 
of the United Nations security council in any order, including at least two 
permanent members of the council. If there have been eight secretaries- 
general in U.N. history, and there are fifteen members of the U.N. security 
council, including the five permanent members, how many ways can you 
answer the quiz, assuming you answer both parts completely? 


6. A midterm exam in phenomenology has two parts. The first part consists of 
ten multiple choice questions. Each question has four choices, labeled (a), 
(b), (c), and (d), and one may pick any combination of responses on each 
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of these questions. For example, one could choose just (a) alone on one 
question, or both (b) and (c), or all four possibilities, or none of them. In 
the second part, one may choose either to answer eight true/false questions, 
or to select the proper definition of each of seven terms from a list of ten 
possible definitions. Every question must be answered on whichever part is 
chosen, but one is not allowed to complete both portions. How many ways 
are there to complete the exam? 


. A ballot lists ten candidates for city council, eight candidates for the school 


board, and five bond issues. The ballot instructs voters to choose up to four 
people running for city council, rank up to three candidates for the school 
board, and approve or reject each bond issue. How many different ballots 
may be cast, if partially completed (or empty) ballots are allowed? 


. Compute the number of ways to deal each of the following five-card hands 


in poker. 


(a) Straight: the values of the cards form a sequence of consecutive inte- 
gers. A jack has value 11, a queen 12, and a king 13. An ace may have 
a value of 1 or 14, so A2345 and 10J QKA are both straights, but 
KA234 is not. Furthermore, the cards in a straight cannot all be of 
the same suit (a flush). 


(b) Flush: All five cards have the same suit (but not in addition a straight). 


(c) Straight flush: both a straight and a flush. Make sure that your counts 
for straights and flushes do not include the straight flushes. 


(d) Four of a kind. 

(e) Two distinct matching pairs (but not a full house). 

(f) Exactly one matching pair (but no three of a kind). 

(g) At least one card from each suit. 

(h) At least one card from each suit, but no two values matching. 


(i) Three cards of one suit, and the other two of another suit, like three 
hearts and two spades. 


9. In the lottery game Texas Two Step, a player selects four different numbers 


between 1 and 35 in step 1, then selects an additional “bonus ball” number 
in the same range in step 2. The latter number is not considered to be part 
of the set selected in step 1, and in fact it may match one of the numbers 
selected there. 


(a) A resident of College Station always selects a bonus ball number that 
is different from any of the numbers he picks in step 1. How many of 
the possible Texas Two Step tickets have this property? 


10. 


11. 


12. 


13. 


14. 


2.2 
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(b) In Rhode Island’s lottery game Wild Money, a gambler picks a set 
of five numbers between 1 and 35. Is the number of possible tickets 
in this game the same as the number of tickets in Texas Two Step 
where the bonus ball number is different from the other numbers? 
Determine the ratio of the number of possible tickets in Wild Money 
to the number in the restricted Texas Two Step. 


CX 
tae} 
wm 


A superstitious resident of Amarillo always picks three even numbers 
and three odd numbers when playing Lotto Texas. What fraction of 
all possible lottery tickets have this property? 


(b 


we 


Suppose in a more general lottery game one selects six numbers be- 
tween | and 2n. What fraction of all lottery tickets have the property 
that half the numbers are odd and half are even? 


(c) What is the limiting value of this probability as n grows large? 
Suppose a positive integer N factors as N = pi'p5?---p?™, where pi, 


P2, +++; Pm are distinct prime numbers and 71, ne, ..., Mm are all positive 
integers. How many different positive integers are divisors of N’? 


Assume that a positive integer cannot have 0 as its leading digit. 


(a) How many five-digit positive integers have no repeated digits at all? 

(b) How many have no consecutive repeated digits? 

(c) How many have at least one run of consecutive repeated digits (for 
example, 23324, 45551, or 151155, but not 12121)? 


How many positive integers are there whose representation in base 8 has 
exactly eight octal digits, at most one of which is odd? An octal digit is a 
number between 0 and 7, inclusive. Assume that the octal representation of 
a positive integer cannot start with a zero. 


Let A be the difference operator: A(f(x)) = f(a +1) — f(x). Show that 
A(x) = net, 
and use this to prove that 


m—1 n+1 


n _ nh 
= n+1° 


Binomial Coefficients 


About binomial theorem I’m teeming with a lot o’ news, 
With many cheerful facts about the square of the hypotenuse. 
— Gilbert and Sullivan, The Pirates of Penzance 
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The binomial coefficients possess a number of interesting arithmetic properties. 
In this section we study some of the most important identities associated with 
these numbers. Because binomial coefficients occur so frequently in this subject, 
knowing these essential identities will be helpful in our later studies. 

The first identity generalizes our formula (2.3). 


Expansion. /f n is a nonnegative integer and k; is an integer, then 


n! . : 
(;) Jno = s™ (2.4) 


0 otherwise. 


Designating the value of (7) to be 0 when &; < O ork > mis sensible, for there 
are no ways to select fewer than zero or more than n items from a collection of n 
objects. 

Notice that every subset of & objects selected from a set of n objects leaves 
a complementary collection of n — k objects that are not selected. Counting the 
number of subsets with k objects is therefore the same as counting the number of 
subsets with n — k objects. This observation leads us to our second identity, which 
is easy to verify using the expansion formula. 


Symmetry. [fn is a nonnegative integer and k is an integer, then 


(i) (2) °° 


Before presenting the next identity, let us consider again the problem of count- 
ing poker hands. Suppose the ace of spades is the most desirable card in the deck 
(it certainly is in American Western movies), and we would like to know the num- 
ber of five-card hands that include this card. The answer is the number of ways 
to select four cards from the other 51 cards in the deck, namely, ene We can 
also count the number of hands that do not include the ace of spades. This is the 
number of ways to pick five cards from the other 51, that is, ee But every poker 
hand either includes the ace of spades or does not, so 


52\ 51 i dl 
5/ \5 4} 
More generally, suppose we distinguish one particular object in a collection of 
n objects. The number of unordered collections of k of the objects that include 
the distinguished object is (aes the number of collections that do not include 


k-1 
this special object is (eae We therefore obtain the following identity. 


Addition. [fn is a positive integer and k is any integer, then 


(= Ce) + Gra) 2.6) 
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We can prove this identity more formally using the expansion identity. It is easy 
to check that the identity holds fork < 0 ork > n. If0 < k < n, we have 


n—1 n—1 (n—1)! (n —1)! 
e +o) - fa Se 
= ((n—k)+k)(n—1)! 
k(n — k)! 
n! 


~ ki(n—k)! 
n 
= (i): 

We can use this identity to create a table of binomial coefficients. Let n > 0 
index the rows of the table, and let k > O index the columns. Begin by enter- 
ing 1 in the first position of each row, since ) = 1 forn > 0; then use (2.6) 
to compute the entries in successive rows of the table. The resulting pattern of 
numbers is called Pascal’s triangle, after Blaise Pascal, who studied many of its 
properties in his Traité du Triangle Arithmétique, written in 1654. (See [85] for 
more information on its history.) The first few rows of Pascal’s triangle are shown 
in Figure 2.1. 


"lk=0 12 3 4 5 6 7 8 9 10| 2 
n=0 1 I 
1 t a 2 
2 121 4 
3 i223 2 8 
4 146 4 1 16 
5 1510 10 5 1 32 
6 1 615 20 15 6 1 64 
? 1721 35 3 21 #7 «#1 128 
8 1 8 2 56 70 56 28 8 1 256 
9 1 9 36 84 126 126 84 36 9 1 512 
10 1 10 45 120 210 252 210 120 45 10 1| 1024 


TABLE 2.1. Pascal’s triangle for binomial coefficients, eae 


The next identity explains the origin of the name for these numbers: They ap- 
pear as coefficients when expanding powers of the binomial expression x + y. 


The Binomial Theorem. /fn is a nonnegative integer, then 


(e+y)"=>° ee. (2.7) 


k 
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The notation >. means that the sum extends over all integers k. Thus, the right 


k 
side of (2.7) is formally an infinite sum, but all terms with k < 0 ork > n are 
zero by the expansion identity, so there are only n + 1 nonzero terms in this sum. 


Proof. We prove this identity by induction on n. For n = 0, both sides evalu- 
ate to 1. Suppose then that the identity holds for a fixed nonnegative integer n. 
We need to verify that it holds for n + 1. Using our inductive hypothesis, then 
distributing the remaining factor of (x + y), we obtain 


(ety =@+yD & ahyn-h 


k 
_ NY k+1,n—-k N\ ko n+1—k 
=D (i) y +S (yet : 


Now we reindex the first sum, replacing each occurrence of k by k: — 1. Since the 
original sum extends over all values of k, the reindexed sum does, too. Thus 


n n ¢, M+1—k nr es nti— 
(x+y) 4a ty +1 "+> (flats +1-—k 
k k 


“E(G2)+G)) are 


by the addition identity. This completes the induction, and we conclude that the 
identity holds for all n > 0. O 


We note two important consequences of the binomial theorem. First, setting 
x = y = 1 in (2.7), we obtain 


S- (;) =", (2.8) 


k 


Thus, summing across the nth row in Pascal’s triangle yields 2”, and there are 
therefore exactly 2” different subsets of a set of n elements. These row sums are 
included in Table 2.1. 

Second, setting zc = —1 and y = 1 in (2.7), we find that the alternating sum 
across any row of Pascal’s triangle is zero, except of course for the top row: 


yn Oe . 0 ifn >1, 
»| ) ()-{i ifn =0. ee) 


This is obvious from the symmetry relation when n is odd, but less clear when n 
is even. 
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These two consequences of the binomial theorem concern sums over the lower 
index of binomial coefficients. The next identity tells us the value of a sum over 
the upper index. 


Summing on the Upper Index. [fm and n are nonnegative integers, then 


ie k n+l 
S- (*) = ce a (2.10) 


k=0 


Proof. We use induction on 7 to verify this identity. Form = 0, each side equals 1 
if m = 0, and each side is 0 if m > 0. Suppose then that the identity holds for 
some fixed nonnegative integer n. We must show that it holds for the case n + 1. 
Let m be a nonnegative integer. We obtain 


(A= (2)+E() 
=F) +4) 
- (747). 


By induction, the identity holds for all n > 0. O 


To illustrate one last identity, we study the Lotto Texas game in more detail. 
Recall that a player selects six different numbers between | and 54 to enter the 
lottery. The largest prize is awarded to anyone matching all six numbers picked 
in a random drawing by lottery officials, but smaller prizes are given to players 
matching at least three of these numbers. To determine fair amounts for these 
smaller prizes, the state lottery commission needs to know the number of possible 
tickets that match exactly k of the winning numbers, for every k. 

Clearly, there is just one way to match all six winning numbers. There are 
(2) = 6 ways to pick five of the six winning numbers, and 48 ways to select one 
losing number, so there are 6-48 = 288 tickets that match five numbers. Selecting 
four of the winning numbers and two of the losing numbers makes ($) (*°) 
16 920 possible tickets, and in general we see that the number of tickets that match 
exactly k of the winning numbers is ies) aan By summing over k, we count 
every possible ticket exactly once, so 


(SB) -D(ena): 


More generally, if a lottery game requires selecting m numbers from a set of m+n 
numbers, we obtain the identity 


On) EG) (n2 a) 
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That is, the number of possible tickets equals the sum over & of the number of 
ways to match exactly / of the m winning numbers and m — k of the n losing 
numbers. More generally still, suppose a lottery game requires a player to select 
£ numbers on a ticket, and each drawing selects m winning numbers. Using the 
same reasoning, we find that 


Or) EC)y) 


Now replace ¢ by ¢ + p and reindex the sum, replacing k by k + p, to obtain the 
following identity. 


Vandermonde’s Convolution. /f 1m and n are nonnegative integers and ¢ and p 


are integers, then 
men m n 
= : 2.11 
we ya yey ia 


Notice that the lower indices in the binomial coefficients on the right side sum 
to a constant. 


Exercises 


1. Use a combinatorial argument to prove that there are exactly 2” different 
subsets of a set of n elements. (Do not use the binomial theorem.) 


2. Prove the absorption/extraction identity: If n is a positive integer and k is a 


nonzero integer, then 
n nf(n-1 
== : 2.12, 
(1) Ele) em 


3. Use algebraic methods to prove the cancellation identity: If m and k are 
nonnegative integers and m is an integer with m < n, then 


(;) i 7 te 6 7 a (2.13) 


This identity is very useful when the left side appears in a sum over k, since 
the right side has only a single occurrence of k. 


4. Suppose that a museum curator with a collection of n paintings by Jackson 
Pollack needs to select & of them for display, and needs to pick m of these 
to put in a particularly prominent part of the display. Show how to count 
the number of possible combinations in two ways so that the cancellation 
identity appears. 
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5. Prove the parallel summation identity: If m and n are nonnegative integers, 
then 
- k 1 
k n 
k=0 


6. Prove the hexagon identity: If n is a positive integer and k is an integer, 
then 


co ae ea ~ (".) ee Cae (2.15) 


Why is it called the hexagon identity? 


7. Compute the value of the following sums. Your answer should be an ex- 
pression involving one or two binomial coefficients. 


80\ /k +1 
oD G)Ca) 
1 /99\ / 200 
© Sails) (i202): 
iss k )\120—k 
201 k 201 j 
Cm» Saree 


k=100 7=100 


2 
(d) > (1) , for a nonnegative integer n. 
k 


(e) S- (=1)" (;) , for an integer m and a nonnegative integer n. 
k<m 


8. Prove the binomial theorem for falling factorial powers, 
(et y= 5S >(" \abynee 
k : 
k 
and for rising factorial powers, 


(at+y)"=>- (1) chy, 


k 


9. Let n be a nonnegative integer. Suppose f(a) and g(a) are functions de- 
fined for all real numbers x, and that both functions are n times differen- 
tiable. Let f‘*) (x) denote the kth derivative of f(a), so f(x) = f(x), 
f(x) = f'(a), and f(x) = f(x). Let h(x) = f(x)g(x). Show that 


Kh (2) => (7) Fedo") 


k 
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10. In the Virginia lottery game Win For Life, an entry consists of a selection 
of six different numbers between 1| and 42, and each drawing selects seven 
different numbers in this range. How many different entries can match at 
least three of the drawn numbers? 


11. The state of Florida administers several lottery games. In Florida Lotto, a 
player picks a set of six numbers between | and 53. In Fantasy 5, a gambler 
chooses a set of five numbers between 1 and 36. In which game is a player 
more likely to match at least two numbers against the ones drawn? 


2.3. Multinomial Coefficients 


Alba, alma, ball, balm, bama, blam, lama, lamb, ma’am, mall, 
malm, mama, .. . 
— Words constructed from letters in “Alma, Alabama” 


Suppose we want to know the number of ways to place n different objects into 
two boxes, one marked A and the other marked B, in such a way that box A 
receives a specified number a of the objects, and box B gets the remaining 6b 
objects, so a + b = n. Assume that the order of placement of the objects in each 
box is immaterial, and denote the total number of such arrangements by (. We 
can compute this number easily by using our knowledge of binomial coefficients. 
Since each valid distribution corresponds to a different subset of a objects for 
box A, we see that es is simply the binomial coefficient (”) (or ('})). Thus, 
= an 

Now imagine we have three boxes, labeled A, B, and C,, and suppose we want 
to know the number of ways to place a prescribed number a of the objects in 
box A, a given number 6 in box B, and the remaining c = n — a — b in box C. 
Again, assume the order of placement of objects in each box is irrelevant, and 
denote this number by (ae Since each arrangement can be described by first 
selecting a elements from the set of n for box A, and then picking b objects from 
the remaining n — a for box B, we see by the product rule that 


Ga leey, 


n! (n —a)! 


n 
b 


= (2.16) 
al(n—a)! b'(n—a-—by)! 
n! 
~ loll’ 
The number ie) is called a trinomial coefficient. 
We can generalize this problem for an arbitrary number of boxes. Suppose we 
have n objects, together with m boxes labeled 1, 2, ..., m, and suppose ky, ko, 


.., km are nonnegative integers satisfying k; + k2+---+k», =n. We define the 
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multinomial coefficient ( ies oe ) to be the number of ways to place k, of the 
objects in box 1, k2 in box 2, and so on, without regard to the order of the objects 
in each box. Then an argument similar to our analysis for trinomial coefficients 


shows that 
n _{n n—ky\ (n—k, — ke 
ky1,...,km/  \ka ko kg 


. (" Ste ies) (2.17) 
Kim—1 


n! 


a ee 


Multinomial coefficients often arise in a natural way in combinatorial problems. 
While we can always reduce questions about multinomial coefficients to prob- 
lems about binomial coefficients or factorials by using (2.17), it is often useful to 
handle them directly. We derive some important formulas for multinomial coef- 
ficients in this section. These generalize some of the statements about binomial 
coefficients from Section 2.2. We begin with a more general formula for expand- 
ing multinomial coefficients in terms of factorials. 


Expansion. /f n is a nonnegative integer, and ky, ..., km are integers satisfying 
ky +---+kp = 7, then 


if each k; => 0, 


(, n ; )- ae ae (2.18) 
Lre+ sm 0 otherwise. 


Taking ae i, = 0 when at least one of the k; is negative is certainly sensible, 
since it is impossible to place a negative number of objects in a box. 

Second, it is clear that rearranging the numbers ky, ..., k, does not affect 
the value of the multinomial coefficient (a ells since this just corresponds to 
relabeling the boxes. We can state this in the following way. 


Symmetry. Suppose 7(1), ..., m(m) is a permutation of {1,...,m}. Then 


nr nr 
= 2.19 
oan fie ea) ae, 


Third, we can observe a simple addition law. Let a be one of the objects from 
the set of n. It must be placed in one of the boxes. If we place a in box 1, then 


n—1 +a: : 
there are ae ways to arrange the remaining n — 1 objects to create 
n—-1 


a valid arrangement. If we set a in box 2, then there are (gel, bail ) to 
complete the assignment of objects to boxes. Continuing in this way, we obtain 
the following identity. 
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Addition. [fn is a positive integer and ky + +--+ km =n, then 


n _ n—-1 de n—-1 
ky,.--) km 7 ky —1,ko,..., km ky, ko —1,k3,...,km 


(2.20) 
ky, ka, pa ->km—1, km —1 


In the last section, the addition identity for m = 2 produced Pascal’s triangle 
for the binomial coefficients. We can use a similar strategy to generate a geomet- 
ric arrangement of the trinomial coefficients when m = 3, which we might call 
Pascal’s pyramid. The top level of the pyramid corresponds to n = 0, just as in 
Pascal’s triangle, and here we place a single 1, for (osaa) The next level holds 
the numbers for n = 1, and we place the three Ls in a triangular formation, just 
below the n = 0 datum at the apex, for the numbers (aoe) (aia) and Gea) 
In general, we use the addition formula (2.20) to compute the numbers in level n 


from those in level n — 1, and we place the value of Caw in level n just below 


‘ -1 -1 -1); 

the triangular arrangement of numbers lo Gees (erate and (ee) in level 
n—1. Figure 2.1 shows the first few levels of the pyramid of trinomial coefficients. 
Here, the position of each number in level n is shown relative to the positions of 


the numbers in level n — 1, each of which is marked with a triangle (A). 


FIGURE 2.1. The first five levels of Pascal’s pyramid. 
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We can use the addition identity to obtain an important generalization of the 
binomial theorem for multinomial coefficients. 


The Multinomial Theorem. [fn is a nonnegative integer, then 


nm ° 
(tit---+2m)"= DO (: k \eh ener (2.21) 
kitertkman Sb m 


Here, the notation )~* kit--tkm=n Means that the sum extends over all integer 
m-tuples (k1,..., km) whose sum is n. Of course, there are infinitely many such 
m-tuples, but only finitely many produce a nonzero term by the Expansion iden- 
tity, so this is in effect a finite sum. We prove (2.21) for the case m = 3; the 
general case is left as an exercise. 


Proof. The formula 
n b 
= Oa? 2° 2.22 
(2+y+2) oe. (, b, Je yz (2.22) 


certainly holds for n = 0, so suppose that it is valid for n. We compute 


a,b,c 
a+b+c=n 
_ n at+l1,bie n a, b+1 ¢ 
(eee > ee 
a+b+c=n atb+c=n 
n a, bictl1 
a s (cae) yz 
a+b+c=n 
nr nr nr 
— + + 1a bye 
a ee ak ae (a) 
a+b+c=n+1 


(i) a, bre 
= 5 uy se, 
: a,b,c 


a+b+c=n+ 
so (2.22) holds for all n > 0. O 


Some additional formulas for multinomial coefficients are developed in the ex- 
ercises. Some of these may be obtained by selecting particular values for x1, ..., 
Lm in the multinomial theorem. 

We close this section by describing a common way that multinomial coeffi- 
cients appear in combinatorial problems. Suppose we need to count the number 
of ways to order a collection of n objects. If all the objects are different, then the 
answer is simply n!, but what if our collection includes some duplicate objects? 
Such a collection is called a multiset. Certainly we expect fewer different arrange- 
ments when there are some duplicate objects. For example, there are just six ways 
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to line up four poker chips, two of which are red and the other two blue: rrbb, 
rbrb, rbbr, brrb, brbr, and bbrr. 

Suppose we have a multiset of size n that includes exactly k, identical copies 
of one object, kz instances of another, and so on, ending with k,, duplicates of 
the last object, so ky +---+ ky», = n. In any ordering of these n objects, we may 
rearrange the k; copies of object 2 in any way without disturbing the arrangement. 
Since we can do this for any of the m objects independently, it follows that each 
distinct ordering of the items occurs k,!k2!---k,,! times among the n! ways that 
one could arrange the objects if they had been distinguishable. Therefore, the 
number of distinct arrangements of our multiset is 


n! - n 
kyls++Kkm! \ki,..., Bm J’ 


We could also obtain this formula by using our first combinatorial model for 
the multinomial coefficients. Suppose we have n ping-pong balls, numbered 1 
through n, and m boxes, each labeled with a different object from our multiset. 
The number of ways to distribute the balls among the boxes, with k, in box 1, kz in 
box 2, and so on, is the multinomial coefficient te iS hin ). But each arrangement 
corresponds to an ordering of the elements of our muitiset: The numbers in box i 
indicate the positions of object 2 in the listing. 

We have thus answered the analogue of Problem | from Section 2.1 for multi- 
sets. We can also study a generalization of Problem 2: How many ways are there 
to make an ordered list of r objects from a multiset of n objects, if the multiset 
comprises k;; copies of object 7 for 1 < 7 < m? Our approach to this problem 
depends on the k; and r, so we’ll study an example. Suppose a contemplative 
resident of Alma, Alabama, wants to know the number of ways to rearrange the 
letters of her home town and state, ignoring differences in case. There are eleven 
letters in all: six As, one B, two Ls, and two Ms, so she computes the total 
number to be (enon) — ao = 13860. 

Suppose she also wants to know the number of four-letter sequences of letters 
that can be formed from the same string, ALMAALABAMA, like the ones in the 
list that open this section, only they do not have to be English words. This is 
the multiset version of Problem 2 with n = 11, r = 4, m = 4, ky = 6, ko = 1, 
and k3 = k4 = 2. We can solve this by constructing each sequence in two steps: 
first, select four elements from the multiset; second, count the number of ways 
to order that subcollection. We can group the possible sub-multisets according to 
their pattern of repeated elements. For example, consider the subcollections that 
have two copies of one object, and two copies of another. Denote this pattern by 
wwa«z. There are (3) = 3 ways to select values for w and x, since we must pick 
two of the three letters A, L, and M. Each of these subcollections can be ordered 
in any of (ea) = 6 ways, so the pattern wwax produces 3-6 = 18 possible 
four-letter sequences in all. There are five possible patterns for a four-element 
multiset, which we can denote wwww, wwwx, wwre, wwaxy, and wxryz. The 
analysis of each one is summarized in the following table. 


2.3 Multinomial Coefficients 149 


Pattern Sub-multisets Orderings per sub-multiset Total 


wwww 1 1 1 

UO (3) (a) 12 
WWLE (3) (a) 18 
wwey (7) (3) (oi) 108 
WLYZ ih 4! 24 


Summing the values in the rightmost column, we find that there are exactly 163 
ways to form a four-letter sequence from the letters in Alma, Alabama. 


Exercises 


1. 


Prove the addition identity for multinomial coefficients (2.20) by using the 
expansion identity (2.18). 


. For nonnegative integers a, b, and c, let P(a,b,c) denote the number of 


paths in three-dimensional space that begin at the origin, end at (a, b,c), 
and consist entirely of steps of unit length, each of which is parallel to a 


coordinate axis. Prove that P(a, b,c) = eres) 


. Prove the multinomial theorem (2.21) for an arbitrary positive integer m. 


. Prove the following identities for sums of multinomial coefficients, if m 


and n are positive integers. 


(a) 2 Co ) =m 
k ? b] m 


ki te-thm 


n 0 ifm=20 
b ii kotkat-e-+koe _ , 
oe, Cs hn)! ) 1 ifm=2@+1. 


. Prove that if n is a nonnegative integer and k; is an integer, then 


d a -2(). 


j 


. Prove the multinomial theorem for falling factorial powers, 


ny n ki km 
(iit: tamz= 5 ‘4 er tin 
ee eee 1y)+++9&m 


and for rising factorial powers, 
w S- n ka Kem 
(ny +++ +a,)" = € k oP ake. 
hase Lys seg Meme, 


You may find it helpful to consider the trinomial case first. 
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10. 


11. 


2.4 
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. Use a combinatorial argument to establish the following analogue of Van- 


dermonde’s convolution for trinomial coefficients. If m and n are nonneg- 
ative integers, anda +b+c=m-+n, then 


m n m+n 
2 Ce Geren — ees 


at+B+y=m 


. State an analogue of Vandermonde’s convolution for multinomial coeffi- 


cients, and use a combinatorial argument to establish it. 


. Compute the number of r-letter sequences that can be formed by using the 


letters in each location below, for each given value of r. Ignore differences 
in case. 


(a) Bug Tussle, TX: r = 3,r =4,r = 11. 

(b) Cooleemee, NC: r = 4,r = 10, r = 11. 
(c) Oconomowoc, WI: r = 4, r = 11,r = 12. 
(d) Unalaska, Alaska: r= 3,r = 4,r = 14. 
(e) Walla Walla, WA: r = 4,r = 5,r = 12. 


Certainly there are more four-letter sequences that can be formed by using 
the letters in Bobo, Mississippi, than can be formed by using the letters in 
Soso, Mississippi. Is the difference more or less than the distance between 
these two cities in miles, which is 267? 


A band of combinatorial ichthyologists asserts that the number of five-letter 
sequences that can be formed using the letters of the Hawaiian long-nosed 
butterfly fish, the lauwiliwilinukunukuw oi’ 0i, is more than twice as large as 
the number of five-letter sequences that can be created using the name of 
the state fish of Hawaii, the painted triggerfish huomuhumunukunukuapua’a. 
Prove or disprove their claim by computing the exact number in each case. 


The Pigeonhole Principle 


Tam just here for anyone that’s for the pigeons. 
— Mike Tyson, Phoenix City Council meeting, June 1, 2005, 
reported in The Arizona Republic 


We now turn to a simple, but powerful, idea in combinatorial reasoning known 
as the pigeonhole principle. We can state it in the following way. 


Theorem 2.1 (Pigeonhole Principle). Let n be a positive integer. If more than 
n objects are distributed among n containers, then some container must contain 
more than one object. 
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The proof is simple—if each container held at most one object, then there 
would be at most n objects in all. 

This mathematical idea is also called the box principle (especially in number 
theory texts), which is sensible enough, since we can imagine the containers as 
boxes. In German, it is the drawer principle, logically enough, after Dirichlet’s 
original term, the Schubfachprinzip. It may seem odd to think of our containers 
as pigeon roosts, but the name probably originally referred to the “pigeonholes” 
one sees in those old desks with lots of square nooks for squirreling away papers. 
(One imagines however that the origins of the term may be the subject of some, 
well, squabbling. ...) So while the traditional name may be somewhat antiquated, 
at least the avian nomenclature saves us from talking about Dirichlet’s drawers. 

The pigeonhole principle is very useful in establishing the existence of a partic- 
ular configuration or combination in many mathematical contexts. We begin with 
a few simple examples. 


1. Suppose 400 freshmen enroll in introductory calculus one term. Then two 
must have the same birthday. Here, the pigeonholes are calendar days, so 
n = 366. 


2. In honor of champion pugilist (and pigeon enthusiast) Mike Tyson, suppose 
that n boxers schedule a round-robin tournament, so each fighter meets 
every other in a bout, and afterwards no contestant is undefeated. Then 
each boxer has between 1 and n — 1 wins, so two boxers must have the 
same record in the tournament. 


3. It is estimated that the average full head of hair has 100000 to 150000 
strands of hair. Let’s assume that the most hirsute among us has less than 
250 000 strands of hair on their head. The city of Phoenix has over 1.5 
million residents, so it follows that there must be at least two residents with 
exactly the same number of hairs on their head. Moreover, since only a 
fraction of the population is bald, the statement surely remains true if we 
exclude those with no hair at all. (Sorry, Iron Mike.) 


In this last problem, we can in fact conclude considerably more. The population 
of Phoenix is more than six times the maximum number of hairs per head, and 
a moment’s thought reveals that there must in fact exist at least six people in 
Phoenix with identical hair counts. We can thus state a more powerful pigeonhole 
principle. 


Theorem 2.2 (Generalized Pigeonhole Principle). Let m and n be positive inte- 
gers. If more than mn objects are distributed among n containers, then at least 
one container must contain at least m+ 1 objects. 


The proof is again easy—if each container held at most ™m objects then the 
total number of objects would be at most mn. An alternative formulation of this 
statement appears in the exercises. Next, we establish the following arithmetic 
variation on the pigeonhole principle. 
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Theorem 2.3. Suppose aj, a2, ..., Gn is a sequence of real numbers with mean 
Ll, SO = (a, +--+ + a,,)/n. Then there exist integers i and j, with1 <i,j <n, 
such that a; < wand a; > pL. 


The proof is again straightforward—if every element of the sequence were 
strictly greater than ju, then we would have a; + --- + Gn > np, a contradic- 
tion. Thus, an integer 27 must exist with a; < ys. A similar argument establishes 
the existence of 7. 

While the pigeonhole principle and the variations we describe here are all quite 
simple to state and verify, this idea plays a central role in establishing many de- 
cidedly nontrivial statements in mathematics. We conclude this section with two 
examples. 


Monotonic Subsequences 


We say a sequence @1, ..., Gy is increasing if ay < ag < +++ < Gyn, and Strictly 
increasing if aj < ag <-++ < Gn. We define decreasing and strictly decreasing 
in the same way. Consider first an arrangement of the integers between | and 10, 
for example, 

3, 5,8, 10,6, 1,9, 2,7, 4. (2.23) 


Scan the list for an increasing subsequence of maximal length. Above, we find 
(3,5, 8, 10), (3,5, 8,9), (3, 5,6, 7), and (3, 5, 6, 9) all qualify with length 4. Next, 
scan the list for a decreasing subsequence of maximal length. Here, the best we 
can do is length 3, achieved by (8, 6,1), (8,6,2), (8,6, 4), (10, 6,2), (10, 6,4), 
(10, 7, 4), and (9, 7, 4). Is it possible to find an arrangement of the integers from 1 
to 10 that simultaneously avoids both an increasing subsequence of length 4 and a 
decreasing subsequence of length 4? The following theorem asserts that this is not 
possible. Its statement dates to an early and influential paper of Erdés and Szek- 
eres [94], the same one cited in Section 1.8 for its contribution to the development 
of Ramsey theory. 


Theorem 2.4. Suppose m and n are positive integers. A sequence of more than 
mn real numbers must contain either an increasing subsequence of length at least 
m + 1, ora strictly decreasing subsequence of length at least n + 1. 


Proof. Suppose that 71, r2, ..., Pmn-+1 1S a sequence of real numbers which con- 
tains neither an increasing subsequence of length m + 1, nor a strictly decreasing 
subsequence of length n + 1. For each integer 7 with 1 < 1 < mn + 1, let a; de- 
note the length of the longest increasing subsequence in this sequence of numbers 
whose first term is 7;, and let d; denote the length of the longest strictly decreas- 
ing subsequence beginning with this term. For example, for the sequence (2.23) 
we see that a2 = 3 (for 5,8, 10 or 5,8,9), and dz = 2 (for 5,1 or 5,2). By our 
hypothesis, we know that 1 < a; < mand1 < d; < n for each 7, and thus 
there are only mn different possible values for the ordered pair (a;, d;). However, 
there are mn + 1 such ordered pairs, so by the pigeonhole principle there exist 
two integers 7 and k with 7 < k such that a; = ax, and d; = dx. Denote this pair 
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by (a, 6), so a = aj = ay and 6 = d; = dy. Now let rg, rin, ..., Ti, denote a 
maximal increasing subsequence beginning with ry, and let rz, rj,,..., ri, denote 
a maximal strictly decreasing subsequence beginning with this term. If r; < rz, 
then 7;, Tk, Tig, --+» Ti, 1S an increasing subsequence of length a + 1 beginning 
with r;. On the other hand, ifr; > rz, then rj, rz, Tits cees Tit is a strictly de- 
creasing subsequence of length 6 + 1 beginning with r;. In either case, we reach 
a contradiction. O 


Of course, we can replace “increasing” with “strictly increasing” and simulta- 
neously “strictly decreasing” with “decreasing” in this statement. 


Approximating Irrationals by Rationals 


Let a be an irrational number. Since every real interval [a, 6] with a < b contains 
infinitely many rational numbers, certainly there exist rational numbers arbitrarily 
close to a. Suppose however we restrict the rationals we may select to the set of 
fractions with bounded denominator. How closely can we approximate a now? 
More specifically, given an irrational number a and a positive integer Q, does 


there exist a rational number p/q with 1 < q < Q and la _ 2| especially small? 


How small can we guarantee? 

At first glance, if we select a random denominator q in the range [1, Q], then 
certainly a lies in some interval Ee ae for some integer k, so its distance to 
the nearest multiple of 1/q is at most 1/2¢. We might therefore expect that on 
average we would observe a distance of about 1/4q, for randomly selected gq. In 
view of Theorem 2.3, we might then expect that approximations with distance at 
most 1/4q must exist. In fact, however, we can establish a much stronger result 
by using the pigeonhole principle. The following important theorem is due to 
Dirichlet and his Schubfachprinzip. 

We first require some notation. For a real number 2, let || denote the floor of 
x, or integer part of x. It is defined to be the largest integer m satisfying m < x. 
Similarly, the ceiling of x, denoted by [], is the smallest integer m satisfying x < 
m. Last, the fractional part of x, denoted by {x}, is defined by {x} = x — |x]. 
For example, for « = 7 we have || = 3, [7] = 4, and {7} = 0.14159...; for 
x = 1 we obtain [1] = [1] = 1 and {1} =0. 


Theorem 2.5 (Dirichlet’s Approximation Theorem). Suppose a is an irrational 
real number, and Q is a positive integer. Then there exists a rational number p/q 
with 1 <q < Q satisfying 


p at 


a—2| <GQ+) 


Proof. Divide the real interval [0, 1] into Q + 1 subintervals of equal length: 


on) laa) ate) le 
GA) lon ea) “loa ea) lea | 
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Since each of the @ + 2 real numbers 


0, {a}, {2a},...,{Qa},1 (2.24) 


lies in (0, 1], by the pigeonhole principle at least two of them must lie in the same 
subinterval. Each of the numbers in (2.24) can be written in a unique way as ra—s 
with r and s integers and 0 < r < Q, so it follows that there exist integers r1, r2, 
81, and so, with O < 71,72 < Q, such that 


1 


\(r2a@ — 82) — (r1a@— 81)| < Oq1' 


Since only 0 and 1 in our list have the same r-value, we can assume that r; 4 ro, 
so suppose r) < rg. Letg = rg — 11,801 <q < Q, and let p = s2 — 51. Then p 
and q satisfy 


1 
a—p|< ——, 
|qa — p| Onl 
and the conclusion follows upon dividing through by gq. oO 


Since g < Q, we immediately obtain that the rational number p/q guaranteed 
by the theorem satisfies 


(2.25) 


1 
q gq +g 
Exercise 11 asks you to show that there exist infinitely many rational numbers 


p/q that satisfy this inequality for a fixed irrational number a. 


Exercises 


1. Show that at any party with at least two people, there must exist at least 
two people in the group who know the same number of other guests at the 
party. Assume that each pair of people at the party are either mutual friends 
or mutual strangers. 


2. Prove the following version of the pigeonhole principle. Let m and n be 
positive integers. If m objects are distributed in some way among n con- 
tainers, then at least one container must hold at least 1 + |(m—1)/n| 
objects. 


3. Prove the following more general version of the pigeonhole principle. Sup- 
pose that m1, m2, ..., Mn are all positive integers, let MZ = m, + m2 + 
-++-+ mM, — n+ 1, and suppose each of n containers is labeled with an 
integer between 1 and n. Prove that if M/ objects are distributed in some 
way among the n containers, then there exists an integer 7 between 1 and n 
such that the container labeled with 7 contains at least m,; objects. 


4. The top four pitchers on a college baseball team combine for 297 strikeouts 
over the course of a season. If each pitcher had at least 40 strikeouts over the 


10. 


11. 


12. 
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course of the season, and the fourth-best pitcher had less than 50 strikeouts, 
how many strikeouts could the best pitcher have made over the season? 
Your answer should be a range of possible numbers. 


Find the smallest value of m so that the following statement is valid: Any 
collection of m distinct positive integers must contain at least two numbers 
whose sum or difference is a multiple of 10. Prove that your value is best 
possible. 


Suppose A = (a1, d2,...,@n) is a sequence of positive real numbers. Let 
H(A) denote the harmonic mean of A, defined by 


H(A) =n (>: 1) 


Show there exist integers 7 and 7, with 1 < 7,7 < n, satisfying 


ay < H(A) < Qj. 


Suppose the integers from | to n are arranged in some order around a circle, 
and let & be an integer with 1 < k < n. Show that there must exist a 
sequence of k; adjacent numbers in the arrangement whose sum is at least 
[k(n + 1)/2]. 


Suppose the integers from | to n are arranged in some order around a circle, 
and let & be an integer with 1 < k < n. Show that there must exist a 
sequence of & adjacent numbers in the arrangement whose product is at 
least [(n!)*/"]. 


Let n be a positive integer. Exhibit an arrangement of the integers between 1 
and n? which has no increasing or decreasing subsequence of length n + 1. 


Let m and n be positive integers. Exhibit an arrangement of the integers 
between | and mn which has no increasing subsequence of length m + 1, 
and no decreasing subsequence of length n + 1. 


Let a be an irrational number. Prove that there exist infinitely many rational 
numbers p/q satisfying (2.25). 


Let n be a positive integer, and let b > 2 be an integer. 


(a) Show that there exists a nonzero multiple N of n whose base-b rep- 
resentation consists entirely of Os and 1s. (No partial credit will be 
awarded for the case b = 2!) Hint: Consider the sequence of numbers 
sy b for a number of values of k. 

(b) Show that there exists a multiple N of n whose base-b representation 


consists entirely of 1s if and only if no prime number p which divides 
bis a factor of n. 
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(c) Suppose that the greatest common divisor of b and n is 1, and let 
d,, ..., dm be a sequence of integers with O < d; < b for each i 
and d, # 0. Show that there exists a multiple N of n whose base- 
b representation is obtained by juxtaposing some integral number of 
copies of the base-b digit sequence d dz --- dy. 


13. Let a1, a2,..., Gn be a sequence of integers. Show that there exist integers 
j and & with 1 < 7 < k < n such that the sum ae a; is a multiple of n. 


14. (Bloch and Pélya [28].) For a positive integer d, let Ng denote the set of 
polynomials with degree at most d— 1 whose coefficients are all 0 or 1. For 
example, N3 = {0,1,2,27,l+2,l1+27,7+27,l+24+ 27}. 


(a) Let f‘*)(x) denote the kth derivative of f(x). Show that if f € Na 
then f*-Y (1) < d&/k. 

(b) Let m be a positive integer. Determine an upper bound on the number 
of different possible m-tuples (f(1), f’(1),..., f©"~)(1)) achieved 
by polynomials f(x) € Na. 


(c) Prove that if d > 1 and 
d s m+ 
log, d 2 J? 


then there exists a polynomial h(a) of degree at most d — 1 whose 
coefficients are all 0, 1, or —1, and which is divisible by (a — 1)”. 


2.5 The Principle of Inclusion and Exclusion 


What we here have to do is to conceive, and invent a notation for, all 
the possible combinations which any number of class terms can 
yield; and then find some mode of symbolic expression which shall 
indicate which of these various compartments are empty or 
occupied ... 

— John Venn, [275, p. 23] 


Suppose there are 50 beads in a drawer: 25 are glass, 30 are red, 20 are spherical, 
18 are red glass, 12 are glass spheres, 15 are red spheres, and 8 are red glass 
spheres. How many beads are neither red, nor glass, nor spheres? 

We can answer this question by organizing all of this information using a Venn 
diagram with three overlapping sets: G for glass beads, R for red beads, and S for 
spherical beads. See Figure 2.2. We are given that there are eight red glass spheres, 
so start by labeling the common intersection of the sets G, R, and S in the diagram 
with 8. Then the region just above this one must have ten elements, since there 
are 18 red glass beads, and exactly eight of these are spherical. Continuing in this 
way, we determine the size of each of the sets represented in the diagram, and we 
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conclude that there are exactly twelve beads in the drawer that are neither red, nor 
glass, nor spheres. 


FIGURE 2.2. A solution using a Venn diagram. 


Alternatively, we can answer this question by determining the size of the set 
GURUS (does this make us counting GURUS?). Summing the number of 
elements in the sets G, R, and S produces a number that is too large, since this 
sum counts the beads that are in more than one of these sets at least twice. We 
can try to compensate by subtracting the number of elements in the sets GN R, 
GS, and ROS from the sum. This produces a total that is too small, since 
the beads that have all three attributes are counted three times in the first step, 
then subtracted three times in the second step. Thus, we must add the number of 
elements in GM RMS to the sum, and we find that 


IGURUS|=|G| +|R| + |S|-|Gn R|-|GnS|-|RNS|+|GN RNS]. 


Letting No denote the number of beads with none of the three attributes, we then 
compute 


No = 50-|GURUS| 
= 50- |G|—|R| - |S} +|GN R|+|GNS|4+|ROS|-|GNRNS| 
= 50 — 25 — 30— 204+ 18+ 124 15-8 
= 12. 


This suggests a general technique for solving some similar combinatorial prob- 
lems. Suppose we have a collection of N distinct objects, and each object may 
satisfy one or more properties that we label a1, ag, ..., a. Let N(a;) denote 
the number of objects having property a;, let N(a;a;) signify the number having 
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both property a; and property a,, and in general let N(a;,a;, ...a@;,,) represent 
the number satisfying the m properties a;,,..., a;,,. Let No denote the number 
of objects having none of the properties. We prove the following theorem. 


Theorem 2.6 (Principle of Inclusion and Exclusion). Using the notation above, 


No = N—S 7 N(ai) + 5° N(aias) = » N(ajajan) +--- 
i i<j i<j<k 
+(-1)" So N(ai,...04,,) + +++ (-1)"N(a1a2... ar). 
41 <-9+<4m 


(2.26) 


Proof. Suppose an object satisfies none of the properties. Then the expression on 
the right side counts it precisely once, in the NV term. On the other hand, suppose 
an object satisfies precisely m of the properties, with m a positive number. Then 
it is counted once in the NV term, m times in the }> N(a;) term, (2) times in 
the second sum, and in general ( times in the kth sum. Therefore, the total 
contribution on the right side from this object is 


pea) 


k 
by (2.9). This completes the proof. O 


We consider four applications of this counting principle. 


The Euler vy Function 


Two integers are said to be relatively prime if their greatest common divisor is 1. 
If n is a positive integer, let y(n) be the number of positive integers m < n that 
are relatively prime to n. This function, called the Euler y function or the Euler 
totient function, is important in number theory. We can derive a formula for this 
function by using the principle of inclusion and exclusion. 

We must name a set and list a number of properties such that the number of 
elements in the set satisfying none of the properties is y(n). Suppose n is divisible 
by precisely r different primes, which we label p, through p,.. Select {1,2,...,n} 
as the set, and let a; be the property “is divisible by p;.” Then No = y(n), and 
it is easy to compute the terms on the right side of the equation in Theorem 2.6: 
N =n, N(ai) = n/pi, N(aia;) = n/(pip;), and so on. Therefore, 


On a 


rae iG PiPj i<j<k PiPjPk P1p2°** Pr 


Exercise 7 asks you to verify the last step. 
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For example, the primes dividing 24 are 2 and 3, so y(24) = 24(1—3)(1-$) = 
8. The eight numbers between | and 24 that are relatively prime to 24 are 1, 5, 7, 
11, 13,17, 19, and 23. 


Counting Prime Numbers 


Suppose m is a composite positive integer, so m can be written as a product of 
two integers that are both greater than 1: m = ab with 1 < a < b. Thena? < m, 
so a < \/m, and so m must be divisible by a prime number p with p < ./m. 

We can use this observation, together with Theorem 2.6, to count the prime 
numbers between 1 and a given positive integer n. We start with the set of integers 
{1,2,...,m}, and use the theorem to count the number of elements that remain 
when multiples of prime numbers p < \/n are excluded from the set. Since every 
composite number m < n has a prime factor p < \/m, excluding all of these 
numbers removes all the composite numbers from the set. 

For example, for n = 120, the largest prime less than or equal to \/n is the 
fourth prime number, 7, so we require just four properties in Theorem 2.6 to 
exclude all the composite numbers in the set {1,2,..., 120}. The four properties 
are a, = “is even,” a2 = “is divisible by 3,” a3 = “is divisible by 5,” and a4 = 
“4s divisible by 7.” We compute N(a,) = 120/2 = 60, N(a2) = 120/3 = 40, 
N(a3) = 120/5 = 24, and N(a4) = |[120/7| = 24. (The quantity |a|was 
defined on page 153.) 

Continuing our calculation, we compute N(a;a2) = |120/6| = 20, then 
N(a a3) = [120/10] = 12, etc., and find that No = 120— (604+ 40+ 24+17)+ 
(20+ 12+8+8+5+4+3)—(4+2+1+1)+0 = 27. But this is not the number 
of prime numbers between | and 120, for our method excludes the primes 2, 3, 5, 
and 7, and includes the nonprime 1. Accounting for these exceptions, we find that 
the number of primes between | and 120 is No + 4 — 1 = 30. 


Chromatic Polynomials 


Let G be a graph. Recall that its chromatic polynomial cg(a) measures the num- 
ber of ways to color the vertices of G using at most x colors in such a way that no 
two vertices connected by an edge have the same color. We can use Theorem 2.6 
to compute chromatic polynomials. 

Suppose G has n vertices, and consider the set of colorings of the vertices of G 
using at most z colors, so the number of colorings in this set is N = x”. To find 
cq(a), we must exclude all of the inadmissible colorings from this set. For each 
edge e; in the graph, select property a; to be “edge e; connects two vertices that 
have the same color.” In this way, the colorings in the set that satisfy none of the 
properties are precisely the admissible colorings, so No = cg(z). 

For example, we compute the chromatic polynomial for the complete graph Iv3 
using this strategy. This graph has three edges, so we take r = 3 in the theorem. 
We compute N(a;) = N(az) = N(a3) = x, since every coloring satisfying 
one of the properties has two vertices with the same color, and the third vertex 
may be any color. Also, N(a1a2) = N(a2a3) = N(aia3) = N(a,a2a3) = x, as 
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every coloring satisfying more than one property must have all vertices colored 
identically. Thus, cx,(z) = No = 2° — 3x22 + 382 —2 = x(x — 1)(x — 2) = 3. 


Derangements 


Suppose a lazy professor gives a quiz to a class of n students, then collects the 
papers, shuffles them, and redistributes them randomly to the class for grading. 
The professor would prefer that no student receives his or her own paper to grade. 
What is the probability that this occurs? Is this probability substantially different 
for different class sizes? What do you think the limiting probability is as n — oo? 
Notice that as n grows larger, there are more ways for at least one person to receive 
his or her own quiz back, but perhaps this increase is swamped by the growth of 
the total number of permutations possible. 

Suppose we have n objects in an initial configuration. A permutation of these 
objects in which the position of each object differs from its initial position is called 
a derangement of the objects. Since n! denotes the number of permutations of n 
objects, following [133] we denote the number of derangements of n objects by 
nj (and since n! is often pronounced “‘n bang,” perhaps nj should be pronounced 
“n gnab’). 

We compute nj for some small values of n. For n = 0, there is just one per- 
mutation, and it vacuously satisfies the derangement condition, so 0; = 1. There 
is only one permutation of a single object, and it is not a derangement, so 1j = 0. 
Only one of the two permutations of two objects is a derangement, so 2; = 1, and 
exactly two of the six permutations of three objects satisfies the condition: If our 
original arrangement is [1, 2,3], then the derangements are [2,3, 1] and [3, 1, 2]. 
Thus 3; = 2. We find that 4; = 9: The derangements of [1, 2, 3, 4] are [2,1, 4, 3], 
(2, 3,4, 1], [2, 4, 1, 3], [3, 1,4, 2], [3, 4, 1, 2], [3, 4, 2, 1], [4, 1, 2, 3], [4,3, 1, 2], and 
[4, 3,2, 1]. Thus, the probability that a random permutation of a fixed number n 
of objects is a derangement is respectively 1, 0, 4, 3, and 3 for n = 0 through 4. 

We can use Theorem 2.6 to determine a formula for nj. We select the original 
set to be the collection of all permutations of n objects, and for 1 < 7 < n let a; 
denote the property that element 2 remains in its original position in a permutation. 
Then No is the number of permutations where no elements remain in their original 
position, so No = nj. 

To compute N(a;), we see that element 7 is fixed, but the other n — 1 elements 
may be arranged arbitrarily, so N(a;) = (n — 1)!. Similarly, N(a;a;) = (n — 2)! 
fori < j, N(ajaj;ax) = (n — 3)! fori < 7 < k, and so on. Therefore, 


m=nl— > (n-Y!I+ D(n—-2I--- 


4 i<j 


ty <<im 
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Since the number of different m-tuples (71, i2,...,im) with 1 <i, < ig <-++< 


im < nis (oy we obtain 


and in the limit, 


lim 4 — a ye =e, (2.27) 


Our lazy professor obtains a desired configuration about 36.8% of the time, for 
sizable classes. 


Exercises 


1. A noted vexillologist tells you that 30 of the 50 U.S. state flags have blue 
as a background color, twelve have stripes, 26 exhibit a plant or animal, 
nine have both blue in the background and stripes, 23 have both blue in the 
background and feature a plant or animal, and three have both stripes and a 
plant or animal. One of the flags in this last category (California) does not 
have any blue in the background. How many state flags have no blue in the 
background, no stripes, and no plant or animal featured? 


2. Suppose 50 socks lie in a drawer. Each one is either white or black, ankle- 
high or knee-high, and either has a hole or doesn’t. 22 socks are white, four 
of these have a hole, and one of these four is knee-high. Ten white socks are 
knee-high, ten black socks are knee-high, and five knee-high socks have a 
hole. Exactly three ankle-high socks have a hole. 


(a) Use Theorem 2.6 to determine the number of black, ankle-high socks 
with no holes. 


(b) Draw a Venn diagram that shows the number of socks with each com- 
bination of characteristics. 
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3. The buffet line at a local steakhouse has 35 dishes. Sixteen dishes contain 


meat, fourteen dishes are fried, and of the dishes with meat, eight contain 
vegetables and seven are fried. Of the fried dishes, five contain a vegetable. 
Just two dishes are fried and contain both meat and a vegetable, and ten 
dishes (principally in the dessert section) contain neither meat nor a veg- 
etable and are not fried. Use Theorem 2.6 to determine how many dishes 
contain vegetables. 


. A sneaky registrar reports the following information about a group of 400 


students. There are 180 taking a math class, 200 taking an English class, 
160 taking a biology class, and 250 in a foreign language class. 80 are 
enrolled in both math and English, 90 in math and biology, 120 in math and 
a foreign language, 70 in English and biology, 140 in English and a foreign 
language, and 60 in biology and a foreign language. Also, there are 25 in 
math, English, and a foreign language, 30 in math, English, and biology, 40 
in math, biology, and a foreign language, and fifteen in English, biology, 
and a foreign language. Finally, the sum of the number of students with a 
course in all four subjects, plus the number of students with a course in none 
of the four subjects, is 100. Use Theorem 2.6 to determine the number of 
students that are enrolled in all four subjects simultaneously: math, biology, 
English, and a foreign language. 


. On a busy evening a number of guests visit a gourmet restaurant, and ev- 


eryone orders something. 140 guests order a beverage, 190 order an entree, 
100 order an appetizer, 90 order a dessert, 65 order a beverage and an appe- 
tizer, 125 order a beverage and an entree, 60 order a beverage and a dessert, 
85 order an entree and an appetizer, 75 order an entree and a dessert, 60 or- 
der an appetizer and a dessert, 40 order a beverage, appetizer, and dessert, 
55 order a beverage, entree, and dessert, 45 order an appetizer, entree, and 
dessert, 35 order a beverage, entree, and appetizer, and ten order all four 
types of items. Use Theorem 2.6 to determine the number of guests who 
visited the restaurant that evening. 


. Use Theorem 2.6 to determine the number of five-card hands drawn from a 


standard deck that contain at least one card from each of the four suits. 


7. Let a1, @2,..., @, be real numbers. Show that 


8. 


Tr 


[[ @ - a) =1- Sait So aia; - Ss Qj, AjAR +> 


i=1 i<j i<j<k 


+ (-1)"ajag:++ ap. 


(a) Show that y(mn) = y(m)y(n) if m and n are relatively prime. 


(b) Show that y(mn) 4 v(m) y(n) if m and n are not relatively prime. 
Is one quantity always larger than the other in this case? 
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(c) Determine all integers n satisfying y(n) = 12, 13, or 14. 
9. Use Theorem 2.6 to count the number of prime numbers less than 168. 


10. Use Theorem 2.6 to determine the chromatic polynomial for each of the 
following graphs. 


(a) The yield sign (add a single edge to the bipartite graph 1,3). 
(b) The bipartite graph K2 3. 


11. Is the probability that a permutation of n objects is a derangement substan- 
tially different form = 12 and n = 120? Quantify your answer. 


12. (Deranged twins.) Suppose n + 2 people are seated behind a long table fac- 
ing an audience to staff a panel discussion. Two of the people are identical 
twins, wearing identical clothing. At intermission, the panelists decide to 
rearrange themselves so that it will be apparent to the audience that every- 
one has moved to a different seat when the panel reconvenes. Each twin can 
therefore take neither her own former place, nor her twin’s. Let T;, denote 
the number of different ways to derange the panel in this way. 


(a) Compute To, 7), T2, and T3. 
(b) Compute 74. 


(c) Determine a formula for T;,, and check that your formula produces 
Tio = 72755 370. 


d) Compute the value of lim ——*—. 

13. Suppose our lazy professor collects a quiz and a homework assignment 
from a class of n students one day, then distributes both the quizzes and the 
homework assignments back to the class in a random fashion for grading. 
Each student receives one quiz and one homework assignment to grade. 


(a) What is the probability that every student receives someone else’s quiz 
to grade, and someone else’s homework to grade? 


(b) What is the probability that no student receives both their own quiz 
and their own homework assignment to grade? In this case, some stu- 
dents may receive their own quiz, and others may receive their own 
homework assignment. 


(c) Compute the limiting probability as m — oo in each case. 


14. Let N,, denote the number of objects from a collection of N objects that 
possess exactly m of the properties a1, a2, ..., a,. Generalize the principle 
of inclusion and exclusion by showing that 


" k 
Nn = ¥ (1 (*) Sk; (2.28) 


k=m 
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where 


a= Yo NG ee): (2.29) 


i1<- <p 


2.6 Generating Functions 


And own no other function: each your doing, 
So singular in each particular, 
Crowns what you are doing in the present deed, 
That all your acts are queens. 
— William Shakespeare, The Winter’s Tale, Act IV, Scene IV 


Given a sequence {a;,} with k > 0, its generating function G(x) is defined by 


Ge)= >" axe. (2.30) 


k>0 


Thus, G(x) is a polynomial if {a;,} is a finite sequence, and a power series if {a;,} 
is infinite. For example, if a, = (—1)*/k!, then G(z) is the Maclaurin series for 
e~”, and if a, = (7) for a fixed nonnegative integer n, then G(x) = (1+ 2)”, by 
the binomial theorem. 

To illustrate how generating functions can be used to solve combinatorial prob- 
lems, let us consider again the problem of determining the number of k-element 
subsets of an n-element set. Fix n, and let az denote this number. Of course, we 
showed in Section 2.1 that a, = n*/k!; here we derive this formula again using 
generating functions. 

Suppose we wish to enumerate all subsets of the n-element set. To construct 
one subset, we must pick which elements to include in the subset and which to 
exclude. Let us denote the choice to omit an element by x°, and the choice to 
include it by x. Using “++” to represent “or,” the choice to include or exclude one 
element then is denoted by x° + x'. We must make n such choices to construct a 
subset, so using multiplication to denote “and,” the expression (x° + 21)” models 
the choices required to make a subset. 

Since “and” distributes over “or” just as multiplication distributes over addi- 
tion, we may expand this expression using standard rules of arithmetic to obtain 
representations for all 2” subsets. For example, when n = 3, we obtain 

(a + x)° = 0° 2° ¢ + ooo! + ote? + 2° ote} 
+a o°s® + a eat 4+ eee? + 21 22!, 
The first term represents the empty subset, the second signifies the subset contain- 
ing just the third item in the original set, etc. Writing 1 for x° and x for x and 
treating the expression as a polynomial, we find that (1+2)° = 1+32+32?+23, 
and the coefficient of x" is the number of subsets of a three-element set having 
exactly k items. 
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In general, we find that the generating function for the sequence {a,} is (1 + 
x)", 80 ap = (7) = n*/k!, by the binomial theorem. Since our proof of the 
binomial theorem relies only on basic facts of arithmetic, this argument provides 
an independent derivation for the number of k-element subsets of a set with n 
elements. 

This example illustrates the general strategy for using generating functions to 
solve combinatorial problems. First, express the problem in terms of determining 
one or more values of an unknown sequence {a;,}. Second, determine a generat- 
ing function for this sequence, writing the monomial «:* to represent selecting an 
object k times, then using addition to represent alternative choices and multipli- 
cation to represent sequential choices. Third, use analytic methods to expand the 
generating function and determine the values of the encoded sequence. 

For example, suppose a drawer contains twelve beads: three red, four blue, 
and five green, and suppose we wish to determine the number of ways to select 
six beads from a drawer, if beads of the same color are indistinguishable, and the 
order of selection is irrelevant. Let a;, denote the number of ways to select k beads 
from the drawer. Then the generating function for this sequence is 


G(z) = (1+e+2%+25)\(1+e4+2? +2°+ 2%) 
-It+et+e%4+e34+24+2°) 
= 1432+ 62? + 102? + 1427 + 172° + 182° 
+170 + 142° + 102" + 62 +324 +2. 


For example, we see from this that there are exactly ag = 18 ways to select 
six beads from the drawer. Indeed, we can check this by constructing all such 
selections: 


rrrggg, rrrggb, rrrgbb, rrrbbb, rrgggg, rrgggb, 
rrggbb, rrgbbb, rrbbbb, rggggb, rgggbb, rggbbb, (2.31) 
rgbbbb, rbbbbb, ggggbb, gggbbb, ggbbbb,  gbbbbb. 


In the following sections, we explore the power of this method by studying 
several combinatorial problems. 


Exercises 


1. In this problem, we verify that the arithmetic operations performed in gen- 
erating functions model the logical selections made in combinatorial prob- 
lems. Write a” to denote selecting k copies of object a. 


(a) Clearly, there are exactly four different subsets of the set {a,b}. We 
can model the construction of the different possible subsets of this 
two-element set by considering two choices: Pick a or not, and then 
pick b or not. Thus, we can denote all the possible choices by writing: 
(a° or a') and (0° or b+). Expand this expression using the logical rule 
“(P or Q) and R = (P and R) or (Q and R)”. Continue expanding 
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until you obtain an expression of the form “C’, or C2 or C3 or C4,” 
where each C; is a logical expression involving only and’s. 


(b) Rewrite exactly the same logical computation, but now use ° in place 
of a° or b°, x! in place of a! or b!, + instead of “or’, and « instead of 
“and”. Then simplify the expression by combining exponents in the 
usual way. 


(c) Repeat this procedure for the three-element set {a, b, c}. 


(d) Repeat this procedure for sub-multisets of the three-element multiset 


{a, a, b}. 


2. Suppose a drawer contains three red beads, four blue beads, and five green 
beads. Use a generating function to determine the number of ways to select 
six beads if one must select at least one red bead, an odd number of blue 
beads, and an even number of green beads. Then check your answer using 
the combinations shown in (2.31). Assume that beads of the same color are 
indistinguishable, and that the order of selection is irrelevant. 


3. Suppose a drawer contains ten red beads, eight blue beads, and eleven green 
beads. Determine a generating function that encodes the answer to each of 
the following problems. 


(a) The number of ways to select & beads from the drawer. 


(b) The number of ways to select & beads if one must obtain an even num- 
ber of red beads, an odd number of blue beads, and a prime number 
of green beads. 


(c) The number of ways to select & beads if one must obtain exactly two 
red beads, at least five blue beads, and at most four green beads. 


2.6.1 Double Decks 


I don’t like the games you play, professor. 
— Roger Thornhill, in North by Northwest 


How many five-card poker hands can be dealt from a double deck? Assume that 
the two decks are identical. More generally, how many ways are there to select m 
items from n different items, where each item can be selected at most twice? Let 
us denote this number by ty, and let G,() be the generating function for the 
sequence {tym} form > 0 and n fixed. 

We find that G,, (a) = (1 +uU+ a); since each object may be selected zero 
times, one time, or two times. To find t,,,., we must determine a formula for 
the coefficient of x” in G,,(a). This is simply a matter of applying the binomial 
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theorem twice: 


E00 


Gn lee) 


where we obtained the last line by substituting m for 7 + k. Therefore, 


mi \ Im 5 
m= eT) 


The number of five-card poker hands that can be dealt from a double deck is 


(290) (36) C)G) 20 


There is a simple combinatorial explanation for this expression. A five-card hand 
dealt from a double deck may have zero, one, or two cards repeated. There are 
(°?) hands with no cards repeated, (*”) (t) hands with exactly one card repeated, 
and (°7) (3) hands with exactly two cards repeated. A similar explanation applies 


for the general formula (2.32). 


Exercises 


1. Derive (2.32) by using the multinomial theorem to expand G', (a). 


2. Use a combinatorial argument to count the number of different five-card 
hands that can be dealt from a triple deck, then the number of five-card 
hands that can be dealt from a quadruple deck. 


3. Use a combinatorial argument to count the number of different six-card 
hands that can be dealt from r combined decks, for each positive integer r. 


4. Use a generating function to determine the number of ways to select a hand 
of m cards from a triple deck, if there are n distinct cards in a single deck. 
Verify that your expression produces the correct answers when n = 52 and 
m=d5orm=6. 
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2.6.2 Counting with Repetition 


Then, shalt thou count to three, no more, no less. Three shalt be the 
number thou shalt count, and the number of the counting shall be 
three. Four shalt thou not count, nor either count thou two, 
excepting that thou then proceed to three. Five is right out. 

— Monty Python and the Holy Grail 


Suppose there is an inexhaustible supply of each of n different objects. How many 
ways are there to select m objects from the n different objects, if you are allowed 
to select each object as many times as you like? 

Let apm denote this number. Evidently, for fixed n, the generating function for 
{n,m }m>0 is 


Gu(a) = (It at a? t+)" = ( : ie 


1-2 


since the sum is just a geometric series in x. This raises questions on convergence, 
for this formula is valid only for || < 1. We largely ignore these analytic issues, 
since we treat generating functions as formal series. 
Thus, to find a formula for ay,,, we must find the coefficient of a” in G,, (x). 
Let us consider a more general problem. Let f(x) = (1 + x)°, where a is a 
real number. Then f’(0) = a, f”(0) = a(a — 1), and in general, f‘*)(0) = a&. 
Therefore, the Maclaurin series for f(a) is 


k 
ge = 5 Sak 
(l+2)*= ae 


k>0 


Define the generalized binomial coefficient by 


K/k\ ifk> 
a\ _ ja /k Ree, (2.33) 
k 0 ifk <0. 


Note that G) equals the ordinary binomial coefficient whenever a is a nonnega- 
tive integer. We have the following theorem. 


Theorem 2.7 (Generalized Binomial Theorem). [f|x| < 1 or a is a nonnegative 


integer, then 
a 
1 o ‘ 2.34 
hrs > ( > s (2.34) 
k 

The proof of convergence may be found in many analysis texts, where it is often 
proved as a consequence of Bernstein’s theorem on convergence of Taylor series 
(see for instance [11]). We do not supply the proof here. 

Before solving our problem concerning selection with unlimited repetition, we 
note a useful identity for generalized binomial coefficients. 
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Negating the Upper Index. /f a is a real number and k is an integer, then 


({) = cae(F oo '). (2.35) 


Proof. For k < 0, the identity is clear. For k > 0, we have 
2 1 &a 
* i=0 


Reindex this product, replacing each 7 by k — 1 — 2, to obtain 


We may now solve our problem of determining apm. We compute 


Gn(z) = (1-2) 


and therefore the number of ways to select m objects from a collection of n dif- 
ferent objects, with repetition allowed, is 


-—1 
anym = 4 ee ) : (2.36) 


m 


For example, the number of five-card poker hands that can be dealt from a stack 
of five or more decks is es = 3819816. 

Finally, suppose we lay all 52 cards of a standard deck face up on a table. How 
many ways can we place five identical poker chips on the cards if we allow more 
than one chip to be placed on each card? To solve this, notice that each possible 
placement of chips corresponds to a hand of five cards, where repeated cards are 
allowed: If k chips lie on a particular card, place that card into the hand k times. 
Further, every such five-card hand can be represented by a judicious placement of 
chips. Therefore, the answer is the same as that of the previous example, Ck 

In general, the number of ways to place m identical objects into n distinguish- 
able bins is the same as the number of ways to select m objects from a set of n 


objects with repetition allowed: The answer to both problems is upc © 
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Exercises 


1. Prove the addition identity for generalized binomial coefficients: If a is a 
real number and & is an integer, then 


a\  fa-l a a-l 
k} k k-1) 
2. Prove the absorption/extraction identity for generalized binomial coeffi- 
cients: If @ is a real number and k is a nonzero integer, then 


a\  afa-1 
k) k\R-1)° 
3. Prove the cancellation identity for generalized binomial coefficients: If a is 
a real number and & and m are integers, then 


a\(k\ _ (a\fa-m 
k}\m} \m/\k-m/)’ 
4. Prove the parallel summation identity for generalized binomial coefficients: 
If a@ is areal number and n is an integer, then 
‘ atk\ fatn+l1 
k -_ n : 
k=0 


5. Suppose that an unlimited number of jelly beans is available in each of five 
different colors: red, green, yellow, white, and black. 


(a) How many ways are there to select twenty jelly beans? 


(b) How many ways are there to select twenty jelly beans if we must 
select at least two jelly beans of each color? 


6. Acatering company brings fifty identical hamburgers to a party with twenty 
guests. 


(a) How many ways can the hamburgers be divided among the guests, if 
none is left over? 


(b) How many ways can the hamburgers be divided among the guests, if 
every guest receives at least one hamburger, and none is left over? 


(c) Repeat these problems if there may be burgers left over. 


7. A zodiac sign is one of twelve constellations that the sun travels through 
(from the vantage point of the earth) over the course of a year. Each person 
has a zodiac sign based on the position of sun on their birth date. The astro- 
logical configuration of a party with n guests is a list of twelve numbers that 
records the number of guests with each sign, so the first number records the 
number of people with the sign Capricorn, the second, Aquarius, ..., the 
last, Sagittarius. 
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(a) How many different astrological configurations are possible for n = 
100? 


(b) How many astrological configurations are possible for n = 100, if 
each component is at least 5? 


8. Two lottery systems are proposed for a new state lottery. In the first sys- 
tem, players select six different numbers from {1,2,...,50}. In the second 
system, players select six numbers from {1,2,...,45}, and may select any 
number as many times as they want. (In the second system, each ball se- 
lected in the lottery drawing is replaced before another ball is selected.) 
Which system has more possible tickets? 


9. Suppose 100 identical tickets for rides are distributed among 40 children at 
a carnival. 


(a) How many ways can the tickets be distributed, if each child receives 
at least two tickets, and all the tickets are distributed? 


(b) How many ways can the tickets be distributed, if each child receives 
at least one ticket, and some tickets may be left over? 


(c) Suppose one child has twelve tickets, and each ticket may be used on 
any of six different rides. How many ways can the child spend her 
tickets, if she can choose any ride any number of times, and the order 
of choice is unimportant? 


2.6.3 Changing Money 


Jesus went into the temple, and began to cast out them that sold and 
bought in the temple, and overthrew the tables of the moneychangers 


— Mark 11:15 


We now turn to a problem popularized by the analyst and combinatorialist George 
Pélya [225]: How many ways are there to change a dollar? That is, how many 
combinations of pennies, nickels, dimes, quarters, half-dollars, and dollar coins 
total $1? Our discussion of this problem follows the treatment of Graham, Knuth, 
and Patashnik [133]. 

Let us define ax to be the number of ways to make k cents in change, and let 
A(a) be a generating function for ay: A(x) = >>), azz". Before analyzing this 
problem, pause a moment and make a guess. Do you think aso is more than 50 
or less than 50? Is ai99 more than 100 or less than 100? How fast do you think 
a, grows as a function of /:? Is it a polynomial in /;? Exponential in k? Perhaps 
something between these? 

To create a pile of change, we must make six choices, selecting a number 
of pennies, then nickels, then dimes, quarters, half-dollars, and dollars. We can 
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model our choice of pennies by the sum 


1 


Teepe 


lteter tar yes 


One might be tempted to use the same expression to model the different choices 
for each of the other coins, since we can pick any number of nickels, and any 
number of dimes, etc., but this would be incorrect. This would yield a generating 
function for the number of ways to select / coins from a set of six different coins, 
not the number of ways to form k cents. Instead, when choosing nickels, we select 
either zero cents, or five cents, or ten cents, and so on, so this selection is modeled 
as 


1 
l+ar tate +---= —_. 
1-2 


Therefore, the number of ways to make k cents using either pennies or nickels is 
given by the generating function 


eee ene 
(1—a)(1— 25) 


Continuing in this way, we find that 


1 
A@) = Grd FAS AaB wT — gry? (37) 


so we merely need to find the coefficient of aj, in the Maclaurin series for A()! 
This sounds rather daunting, so let us determine a few values of a;, by hand first. 

Let {p,} denote the number of ways to make k cents using only pennies, so 
pe = 1 for all k. Let P(a) be the generating function for {p;,}, so P(x) = 
1/(1 — x). Let nx be the number of ways to make k cents using either pennies or 
nickels, so its generating function is 


P(x) 


N(a) = fae 


Thus N(x) = P(x) + 2° N(z), and by equating coefficients we find that 


Dk if0<k <4, 
ie => 
Fe ne bes. RSS: 


In the same way, let d, denote the number of ways to make é cents using pennies, 
nickels, or dimes, and let D(a) be its generating function. We then have D(x) = 
N(a)+ 21° D(z), and so 


_ fing if0<k<9, 
OV Ldeag. GER > 10. 
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There is a simple combinatorial interpretation for this equation. If k < 10, then 
we can choose only nickels and pennies to form k cents, so dy, = nx in this case. 
If & > 10, we may form é; cents using only nickels and pennies, or we can choose 
one dime, then form the remaining k — 10 cents using dimes, nickels, and pennies. 
Thus dy, = nz + dz—10 in this case. 

Similarly, using q; for allowing quarters, h;, for half dollars, and finally a, for 
dollar coins, we have 


dy if0<k < 24, 
-— i + qk-25 if k > 25; 
m= {i if0<k < 49, 

Gk +hp—so if k > 50; 

fie if0<k < 99, 
— : 4: de-ton- GPRS 100. 


We may use these formulas to construct Table 2.2 below showing the number of 
ways to make k cents with the different coin sets. 


0 5 10 15 20 25 30 385 40 45 50 
lL 1 © Tt 1 1 1 1 je 1 
1 2 3 4 #5 6 8 9 10 Il 
1 2 4 6 9 12 16 20 25 30 36 
1 13 49 
1 50 
1 


55 60 65 70 75 80 85 90 95 100 
1 1 1 1 1 1 1 1 1 1 
12 13 14 15 16 17 #18 «19 20 21 


42 49 56 64 72 81 100 121 
121 242 

292 

293 


TABLE 2.2. Computing the number of ways to make k cents in change. 


We find that there are precisely 50 ways to make 50 cents in change, and 293 
ways to make one dollar in change. 

This is a fairly efficient method to determine a,, since apparently we can cal- 
culate this number using at most 5k arithmetic operations. But we can do much 
better! We can compute a; using at most a constant number of arithmetic oper- 
ations, regardless of the value of k. To show this, let us first simplify A(a) by 
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exploiting the fact that all but one of the exponents in (2.37) is a multiple of 5. Let 


(«yd — 2) — 2 (1 — e (1 — ae) 


so that 
A(z) = (1+a+2?+2° +2"*)Bi(e°). 


Writing b;, for the coefficient of x” in the Maclaurin series for B(x) and equating 
coefficients, we find that 


be = a5k = A5k41 = A5k+2 = A5k+3 = Ask+4- 


But this makes sense (or perhaps cents?), since the last few cents can be repre- 
sented only using pennies. Now 


na) = 
(1 — 220) 
where 
C(x) = (pede) Oa de®) (140% +0! 4015) 


(1+2°) 
=o! 4 98? 4d 4 Ge 400" + 13e" 4+ 182 + 24¢" 4 31¢% 
3927? + 5027) + 6227 + 772° + 930% + 1120° + 1340 
41592 4+ 18705 +2180 + 9520 4 9870%! 4. 995n°? 4 span? 
+ 4062°° + 44925” + 4932°° + 5382°5 + 584a°4 + 6312°% + 679a°? 
+ 7222°1 + 7662°° + 805279 + 845278 + 88004” + 9102** + 93524 
+ 95524 + 970243 + 98024? + 985271 + 985a*° + 98029 + 970x°8 
+ 95523" + 9352°6 + 91027 + 88024 + 845239 + 8052°? + 766034 
+ 7222°9 + 679279 + 631278 + 58427" + 538276 + 493075 + 449224 
+ 406273 + 36427? + 325271 + 287270 + 2520) + 21818 + 18721" 
+ 159216 + 134275 + 1122 4+ 93213 + 772) + 6201! + 5021° 
392° + 3108 + 24a" + 182° + 132° + 9a* + 6x? + 4a? + 2a +-1. 


We know from the previous section that 
1 n+k—-1)\ , 
a—y ape n—1 ): i 


sO 


Ba) =C(e) >, (" : °) pte (2.38) 
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Therefore, writing C(x) = 7, crax®, we have ai99 = b29 = Co (2) + C29 (2) = 
6 + 287 = 293, and 


a1900 = 6200 


_ 15 . 14 a 13 fa 12 ie 11 
=—Ol..2 ©20\ 5 C40 \ 5 C60 | 5 Ca0\ 5 
= 2103596. 


Our expression for computing a; is a sum having at most five terms, so this 
method allows us to compute a;, using only a constant number of operations. 

Finally, consider the crazy system of coinage where there is a coin minted worth 
n cents for every n > 1. Let p, denote the number of ways to make n cents in 
change in this system. For example, pa = 5, since we can make four cents by 
using four pennies, or one two-cent piece and two pennies, or one three-cent piece 
and one penny, or two two-cent pieces, or one four-cent piece. By representing 
these five possibilities as the sums 1+ 14+1+1,2+1+1,38+4+1,24 2, 
and 4, we see that p,, is the number of ways to write n as a sum of one of more 
positive integers, disregarding the order of the summands. Such a representation 
is called a partition of n. Evidently the generating function P(2:) for the sequence 
of partitions is given by the infinite product 


1 
Pia) = [| rs, (2.39) 
k>1 


We explore this generating function and the sequence {p,,} in Section 2.8.1. 


Exercises 


1. Use (2.38) to compute a2o09, the number of ways to make $20.09 in change. 


2. How many ways are there to select 100 coins from an inexhaustible supply 
of pennies, nickels, dimes, quarters, half-dollars, and dollar coins? 


3. Show that the number of ways to make 10m cents in change using only 
pennies, nickels, and dimes is (m + 1)?. 


4. Show that a; can be computed using equation (2.38) using at most 60 arith- 
metic operations. Optimize your method to show that a, can be computed 
using at most 31 arithmetic operations. 


5. Prove that a, grows like k° by showing that there exist positive constants c 
and C' such that ck° < ay < Ck? for sufficiently large k. 


6. The following coins were in circulation in the United States in 1875: the 
Indian-head penny, a bronze two-cent piece (last minted in 1873), a silver 
three-cent piece (also last minted in 1873), a nickel three-cent piece, the 
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shield nickel (worth five cents), the seated liberty half-dime, dime, twenty- 
cent piece (produced for only four years beginning in 1875), quarter, half- 
dollar, and silver dollar, and the Indian-head gold dollar. (We ignore the 
trade dollar, minted for circulation between 1873 and 1878, as it was issued 
for overseas trade. This coin holds the distinction of being the only U.S. 
coin to be demonetized.) 


(a) How many ways were there to make twenty cents in change in 1875? 
How about twenty-five cents? Compute these values using the tabular 
method of this section. 


(b) Write down a generating function in the form of a rational function 
for the number of ways to make k cents in change in 1875, then use 
a computer algebra system to find the number of ways to make one 
dollar in change in 1875. 


. Unspired in part by [133, ex. 7.21].) A ransom note demands: 


(i) $10000 in unmarked fifty- and hundred-dollar bills, and 
(ii) the number of ways to award the cash. 


You realize that both old-fashioned and redesigned anticounterfeit bills are 
available in both denominations. 


(a) Answer the second demand of the ransom note. For extra credit, an- 
swer the first demand ~. 


(b) Find a closed form for the number of ways to make 50m dollars us- 
ing the two kinds of fifty- and hundred-dollar bills, for a nonnegative 
integer m. 


. In 2010, there are six different kinds of nickels in general circulation in 


the U.S., and six different kinds of pennies. Four of the varieties of nickels 
were issued in 2004 and 2005 and commemorate the bicentennial of the 
Lewis and Clark expedition—their respective designs on the reverse show a 
handshake, a boat, a bison, and an ocean view; the other two show president 
Jefferson’s home, Monticello. Four of the pennies were issued in 2009 to 
commemorate the bicentennial of Lincoln’s birth, with each design evoking 
a different period of the life of the U.S. president. 


(a) Determine a generating function in closed form for the number of 
ways a, to make k cents in change using only pennies and nickels 
available in 2010, counting each design as a different coin. 


(b) Determine a finite sequence co, C1, ..., Cn So that 


ae c + ,) 
ay = ee Ck—5j . 


9. 


10. 
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(c) Use the formula to determine a;, and verify that your answer is cor- 
rect. 


(d) Use the formula to determine a23, a@24, and a25. 


In 2010, there are fifty commemorative quarters in general circulation in 
the U.S., one for each state, and sixteen different presidential dollar coins, 
showing Washington through Lincoln on the obverse. Prove that the number 
of ways to make 25k cents in change using just these 66 different coins is 


oo (> *) oe ‘) (°F ‘), 


Then use this formula to determine the number of ways to change one dollar 
using just these coins. 


A hungry math major visits the school’s cafeteria and wants to know the 
number of ways 5; to take k servings of food, including at least one main 
course, an even number (possibly zero) of side vegetables, an odd number 
of rolls, and at least two desserts. The cafeteria’s food can be distinguished 
only in the coarsest way: Every dish is either a main course, a side veg- 
etable, a roll, or a dessert. There is an unlimited supply of each kind of dish 
available. 


(a) Determine a closed form for the generating function }°,, 5; , 


(b) Show that 
+1 KAl 
an (OE). (MEN), 


The quantities | «| and [] are defined on page 153. 


2.6.4 Fibonacci Numbers 


Attention! Attention! Ladies and gentlemen, attention! There is a 
herd of killer rabbits headed this way and we desperately need your 
help! 

— Night of the Lepus 


Hey, shouldn’t that be a colony of killer rabbits? 

Leonardo of Pisa, better known as Fibonacci, proposed the following harey 
problem in 1202. Assume that the rabbit population grows according to the fol- 
lowing rules. 


1. 


2. 


Every pair of adult rabbits produces a pair of baby rabbits, one of each 
gender, every month. 


Baby rabbits become adult rabbits at age one month and produce their first 
offspring at age two months. 
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3. Rabbits are immortal. 


Starting with a single pair of baby rabbits at the start of the first month, how many 
pairs of rabbits are there after k months? 

Let fF, denote this number. In the first month, there is one pair of baby rabbits, 
so Ff = 1. Likewise, Fy = 1, as there is one pair of adult rabbits in the second 
month. In the third month, we have one baby pair and one adult pair, so F3 = 2, 
and in the fourth month, the babies become adults and the adults produce another 
pair of offspring, so there is one pair of babies and two pairs of adults: Fy = 3. 
Continuing in this way, we record the population in the following table. 


k Babypairs Adult pairs F;, 
0 0 0 0 
1 1 0 1 
2 0 1 1 
3 1 il 2 
4 1 2 3 
5 2 3 5 
6 3 5 8 
7 5 8 13 


Notice that the number of pairs of adults in month k equals the total number of 
pairs of rabbits in month & — 1. This is F,_1. Also, the number of pairs of baby 
rabbits in month k equals the number of adult pairs in month & — 1, which is the 
total number of pairs in month k — 2. This is F;,_2. Therefore, 


Fy = Fe-1+ Fe-2, k>2. (2.40) 

This recurrence, together with the initial conditions fy = 0 and F) = 1, deter- 

mines the Fibonacci sequence {Fi} = {0,1,1,2,3,5,8, 13,21, 34,55, 89,...}. 
This sequence appears frequently in combinatorial problems. 

In this section we determine a closed form for F;, by analyzing its generat- 

ing function. We will adapt this technique to solve other recurrences later in this 


chapter. 
Let G() be the generating function for {Fj}. Then 


G(x) = ‘S Fyak 
k>0 


=Fo+Firt > Fyak 
k>2 


=eU> a + Fy_2)a* 


k>2 
=S@TsZ@ ) Pie + x ) ee 
k>2 k>2 


= ctx) Frat +2°S° yak, 


k>1 k>0 
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and so 
G(x) =2+2G(az) + 27G(2). 
Therefore, 
G(a) = ees 
TS +a? 


and thus F;, is the coefficient of x” in the Maclaurin series for this rational func- 
tion. How can we determine this series without all the messy differentiation? 

The trick is using partial fractions to write G(a) as a sum of simpler rational 
functions. Write x? + 2 — 1 = (x + y)(x + ¢), where ¢ is the golden ratio, 
y = (1+ V5)/2, and ¢ = (1 — V5) /2. Write 


So A B 
Sie eee toe 
uw +2 r+p “L+H 


and solve to find that A = —y/\/5 and B = ¢/\/5. Thus 


60) = Js (sS5-a%s) 
ee 
1 i) 1 
-3 (ie -oe) 


since yy = —1. Now the two terms on the right are closed forms for simple 
geometric series, so 


and therefore 


i a (2.41) 
Notice that || < 1, so Fy ~ y*/./5: a large number of rabbits indeed. 


Exercises 


1. In each of the following problems, first compute the value the expression 
for a few small values of n. Then use your data to conjecture a general 
formula. Last, prove that your formula is correct. 


(a) 2 FB 
k=0 


(b) Do Far. 
k=0 
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() D Far-1, ifn > 1. 
k=1 


() FroiFn1 — F?, ifn > 1. 
2. Solve the recurrence ay = 2a,_1 + 3a,~2, if ag = 0 and a; = 8. 


3. Suppose ag = 0, a, = 5, and ap = az_—1 + Gaz_2 for k > 2. Compute 
a closed form for the generating function of the sequence {a;,}. Then use 
this to determine a formula for az. 


4. Solve the recurrence az, = 2az_1 + 2ap_2, if ag = 0 and a; = 1. 
5. Prove the following identities involving Fibonacci numbers. 


(a) Fen = Fn Fait + Fn_i1Fn, ifm > 1landn > 0. 
(b) F2 + F2,, = Fonsi. 
(c) F2,, — F2_, =F, ifn >1. 


() D0 Fe = FrFntt- 
k=0 


@) S\(-1)"*® = F-1- (-1)",ifn > 1 
k=0 


() So (-1)"*kF, = (n+ 1)Fr—1 — Fre — 2(-1)”, ifn > 2. 
k=0 


6. Prove that if m and n are nonnegative integers, then F,,, divides Fi. 


7. The Lucas numbers are defined by Lo = 2, L; = 1, and Ly = Lp_1+Lpr-2 
for k > 2. Find a formula for L;, in terms of vy and @. 


8. Prove the following identities involving Lucas and Fibonacci numbers. 


(a) Ln = £n+1 + Fn-1, if n 2 ile 


(b) ~ Lp = Laleig 23, 
k=0 


(©) S\(-1)*Ln—k = En-1 + 3(-1)”, ifn > 1. 
k=0 


(d) Pon = Eelins 
(€) Ing = 1B? =9(—1)*. 


9. The Perrin sequence is defined by ag 3, a1 0, ag 2, and a, = 
dp—2 + ap—3 for k > 3. The Padovan sequence is defined by bb = 0, 
by = 1, bo = 1, and by = bg_2 + De_3 for k > 3. 
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(a) Find generating functions in the form of rational functions for the 
Perrin sequence and the Padovan sequence. 


(b) Prove that a, = r&+o*%+a*, where r, a, and @ are the three complex 
k 


roots of 7? — « — 1. Conclude that a; ~ r*. 
The Perrin sequence has an interesting property: If p is a prime number, then 
p divides the pth term in the Perrin sequence, p | ap. This was first noted 
by Lucas in 1878 [192-194] (perhaps Lucas would have been interested in 
Exercise 6 of Section 2.6.3). Thus we obtain a test for composite numbers: 
If n does not divide a,,, then n is not prime. Unfortunately, the converse is 
false: There are infinitely many composite n with the property that n | a. 
This was proved by Grantham [137]. 


10. In the children’s game of hopscotch, a player hops across an array of squares 
drawn on the ground, landing on only one foot whenever there is just one 
square at a position, and landing on both feet when there are two. If ev- 
ery position has either one or two squares, how many different hopscotch 
games have exactly n squares? Figure 2.3 shows the five different hop- 
scotch games having four squares. 


FIGURE 2.3. Hopscotch games with four squares. 


11. Use a combinatorial argument and Exercise 10 to prove that 
n—-k-1 
FF, = : 
eae 


2.6.5 Recurrence Relations 


O me! O life!... of the questions of these recurring; 
— Walt Whitman, Leaves of Grass 


In the “Tower of Hanoi” puzzle, one begins with a pyramid of k disks stacked 
around a center pole, with the disks arranged from largest diameter on the bottom 
to smallest diameter on top. There are also two empty poles that can accept disks. 
The object of the puzzle is to move the entire stack of disks to one of the other 
poles, subject to three constraints: 
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1. Only one disk may be moved at a time. 
2. Disks can be placed only on one of the three poles. 
3. A larger disk cannot be placed on a smaller one. 


How many moves are required to move the entire stack of k disks onto another 
pole? Let a; denote this number. Clearly, aj = 1. To move k disks, we must first 
move the k — 1 top disks to one of the other poles, then move the bottom disk to 
the third pole, then move the stack of k — 1 disks to that pole, so a, = 2az-1 +1 
for k > 1. Thus, ag = 3, a3 = 7, a4 = 15, and it appears that a, = QF = 1. 

We can certainly verify this formula by induction, but we wish to show how 
recurrences of this form can be solved by using generating functions. Consider 
the more general recurrence 


ap = baz_-1 + ¢, k>1, 


where b and c are constants. This is a linear recurrence relation, since a, is a linear 
function of the preceding values of the sequence. (The Fibonacci recurrence is 
also a linear recurrence relation.) If cis zero, we call the recurrence homogeneous; 
otherwise, it is inhomogeneous. 

Let G() be the generating function for {a;,}. Then 


G(x) = LS apx® 


k>0 
=agt y (ba, 12" + ex) 
k>1 
=aj+ ba y aga” + cx ) ak 
k>0 k>0 
cx 


= ao + brG(ax) + i= 


and so 
cx ao 


oO) Gepaiae” T= 


Assuming b # 1, we compute 


cx _ ¢ 1 7 1 
(—bz)\l—2) b-1\l-br 1-2)’ 


ate) = (20+ 525) (=) -s5z a) 


= (co 22) otal — al 


k>0 k>0 


sO 
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and therefore 


= Cc k Cc 
u(t pS) Fant (2.42) 


For example, to find the number of moves needed to solve the Tower of Hanoi 
puzzle, we set ag = 0, b = 2, and c = 1| to obtain a, = 2% — 1. Also, if we set 
i= —4 andc = 2, we find that a, = (—1)* (ao — $)/2* + 4, so ag approaches 
3 as k grows large, independent of the initial value ao. 

We conclude with a short list of useful generating functions. Since 


1 
t= ye, (2.43) 


k>0 


we differentiate both sides to find that 


and so 


— = Ska. (2.44) 


Thus we obtain a closed form for the generating function of the identity sequence 
{k}. We take up the problem of determining a generating function for {k”}, for 
any fixed positive integer n, in Section 2.8.5. 

Finally, we integrate both sides of (2.43) to obtain the generating function for 


{1/k}: : 
-In(l-z)=>> e (2.45) 
k>1 
Exercises 


1. Find a recurrence relation for the maximal number of regions of the plane 
separated by &; straight lines, then solve it. 


2. Solve for a;, in terms of ao and the other parameters in each of the following 
recurrence relations. 
(a) Gy = Ap_-1 +6. 
(b) ay = bap_1 + cb". 
(c) ap = bax_1 + cr®, assuming b F r. 
(d) ay = bay_1 + cr*® + d, assuming b ¢ {1,r}. 


(e) a, = bag_1 + ck, assuming b £ 1. 


(f) ap = bax_1 + ck + d, assuming b # 1. 


3. Find a closed form for the generating function of the sequence {k?};,>0. 
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4. Let v, denote the number of ways that 3n different people can split up into 
n three-person teams for a volleyball tournament, and let vp = 1. Assume 
that team members are unordered, so the team {a, b, c} is the same as the 
team {c, a, b}, and assume that the teams are unordered, so putting {a, b, c} 
on the first team and {d, e, f} on the second is the same as putting {d, e, f} 
on the first team and {a, b,c} on the second. Determine a recurrence rela- 
tion for v,, then use it to compute v4. 


5. Let dy, denote the minimal degree of a polynomial with {0, 1} coefficients 
that is divisible by (2 + 1)". For example, certainly d; = 1, since f(x) = 
x +1 has the required properties, and dz < 4, since fo(x) = (2 +1)(23 + 
1) = 24+ 23 +2 +1 is permissible (in fact, dp = 4). 


(a) Determine an upper bound on d3 by multiplying f2(a) by a suitable 
binomial of the form x” + 1, choosing r as small as possible. Then 
iterate this process to obtain upper bounds for d4 and ds. 


(b) Observe that one can obtain an upper bound on d;, in general by con- 
structing a polynomial of the form 


k 
fla) =T] @™ +1) 


i=l 
for a judiciously selected sequence {r;}. Describe how to calculate 
{r;}, and compute the values of this sequence for i < 7. 
(c) Determine a linear, homogeneous recurrence relation for the sequence 
{rif 
(d) Compute a closed formula for rj. 
(e) Determine an upper bound for dx. 
6. A binary sequence is a sequence in which each term is 0 or 1. Determine a 


recurrence relation for the number of binary sequences of length n that do 
not contain two adjacent 1s, then find a simple expression for this number. 


7. Let t,, denote the number of binary sequences of length n that do not con- 
tain three adjacent Is. 


(a) Determine a recurrence relation for t,,, and enough initial values to 
generate the sequence. 
(b) Determine a closed form for the generating function 


T(x) = S- tnx”. 


n>0 


(c) Define t7 by (6 = t] = 0, tf = 1, and t} = ty_-3 forn > 3. 
Determine a closed form for T*(x) = }>,,s, t,x". The numbers {t* } 
are known as the tribonacci numbers. 
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8. For a fixed positive integer m, let s,,, denote the number of binary se- 
quences of length n that do not contain m adjacent 1s. 


(a) Determine a recurrence relation in n for 8, and enough initial val- 
ues to generate the sequence. 


(b) Show that the generating function S,,,(x) for {$n,m}n>0 is 


l=a™ 


Sm(x) = emtl oe 41° 


Hint: First define a sequence Smryn from 5 ,n 1n the same manner as 
Exercise 7c. Then find its generating function S* (a), and use this to 
determine S(x). The numbers {s7, ,,}n>o0 are known as the general- 
ized Fibonacci numbers of order m, or the m-generalized Fibonacci 
numbers. 


2.6.6 Catalan Numbers 


zero, un, dos, tres, quatre, cinc, sis, set, vuit, nou, deu, onze, dotze, 
tretze, catorze, quinze, setze, disset, divuit, dinou, vint. 


How many ways are there to compute a product of k + 1 matrices? Matrix mul- 
tiplication is associative but not commutative, so this is the number of ways to 
place k — 1 pairs of parentheses in the product x7o71 ...2, in such a way that the 
order of multiplications is completely specified. Let Ci, denote this number. 

Let us first compute a few values of C;,. There is only one way to compute 
the product of one or two matrices. There are two ways to group a product of 
three matrices, (%21)a2 and %o(x1 22), and there are five ways for a product of 
four matrices: ((w92'1)@2)X3, (wo(a122))x3, (2921) (xo23), xo((x122)x3), and 
Xo(x1(x2%3)). A bit more work gives us 14 ways to compute a product of five ma- 
trices: There are five ways if one pair of parentheses is xo(71 224324), another five 
for (%9%1%2%3)x4, two for (%oX1)(%2u%3%4), and two more for (79x1X2)(%3L4). 
We record these numbers in the following table. 


k Cy 
0 1 
1 ol 
2 2 
3.C«S8 
4 14 


Can we determine a recurrence relation for C;,? 
Suppose we group the terms so that the last multiplication occurs between 7; 
and 2441: 


(aor1 oo Li) (Lig sae Lr). 
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Then there are C; ways to group the terms in the first part of the product, and 
Cy—1-; ways for the second part, so there are C;C,_1_; ways to group the re- 
maining terms in this case. Summing over 7, we obtain the following formula for 
the total number of ways to group the /: + 1 terms: 


k-1 
Cr = 5, CiCri-i, k2 1. (2.46) 
i=0 


We compute 


Or = CsCo = 1, 
Co = CoC1 + CiCo = 2, 

Cs = O05 2-010, £0) =5; 

C= GC, £0120, £OG = 

Ce = OO Cis 2 Oy OC 4 O.0y = 42. 


We would like to solve this recurrence to find a formula for Cx, so let us define 
the generating function for this sequence, 


G(z) = » Cy". 


k>0 


Unlike other recurrences we have studied, this one is not linear, and has a vari- 
able number of terms. To solve it, we require one fact concerning products of 
generating functions. 
_ k = wk 
If A(x) = dops9 ee” and B(x) = d 7,59 bea", then 


k 
A(x) B(a) = S- (>: wins) gk. 


k>0 \i=0 


Let c, = pee a;D,—;. The sequence {c;,} is called the convolution of the se- 
quences {a;,} and {b;,}. Thus, the generating function of the convolution of two 
sequences is the product of the generating functions of the sequences. 

Using this fact, we find that 


k>0 
k-1 
=M+)>> (x 01-1] x 
k>1 \i=0 
k 
=142>( cic) 
k>0 \i=0 
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since ae C;C;,—:} is the convolution of {C;,} with itself. Thus, 
G(x)? — G(x) +1=0, 


and so 


1+VJ/1—4¢ 
Qu , 
Only one of these functions can be the generating function for {C;,}, and it must 
satisfy 
lim G(r) = Co = 1. 


x2z—0 


It is easy to check that the correct function is 


1—vV1l- 42 


oy) = 20 


We now expand Ga) as a Maclaurin series to find a formula for C;,. Using the 
generalized binomial theorem and the identity for negating the upper index, we 
find that 


k>1 
k-1/2\ , 5 
=1+4 4 
Ds ( k+1 ) 
k>0 
Therefore, 
k-1/2\ , & 
2) = k+1 ) 
k>0 
and so k—1/2 
C g2k+1 _ 
k+1 


We can find a much simpler form for C,. Expanding the generalized binomial 
coefficient and multiplying each term in the product by 2, we compute that 


g2k+1 k 1 
O= eae (#- 3-4) 


oe 
="GaD [[(@«-1- 2%). 
i=0 
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The product consists of all the odd numbers between —1 and 2k — 1, so 


ae 
Cr = yay Le - 


gk (24 — 1)(2) 
- gap ll 2i 


— 
= HeTDI [[@:- De). 


i=l 


The remaining product is simply (2)!, so 


(2k)! 12k 


Cy is called the kth Catalan number. 

Incidentally, since C; is an integer, we have shown that k + 1 always divides 
the binomial coefficient oan Can you find an independent arithmetic proof of 
this fact? 

Sloane and Plouffe [258] remark that the Catalan numbers are perhaps the sec- 
ond most frequently occurring numbers in combinatorics, after the binomial coef- 
ficients. Indeed, Stanley [262, ex. 6.19] lists 66 different combinatorial interpre- 
tations of these numbers! We close with another problem whose solution involves 
the Catalan numbers. 

A rooted tree is a tree with a distinguished vertex called the root. The vertices 
in a rooted tree form a hierarchy, with the root at the highest level, and the level of 
every other vertex determined by its distance from the root. Some familiar terms 
are often used to describe relationships between vertices in a rooted tree: If v and 
w are adjacent vertices and v lies closer to the root than w, then v is the parent of 
w, and w is a child of v. Likewise, one may define siblings, grandparents, cousins, 
and other family relationships in a rooted tree. 

We say that a rooted tree is strictly binary if every parent vertex has exactly two 
children. How many strictly binary trees are there with k parent vertices? Do not 
take symmetry into account: If two trees are mirror images of one another, count 
both configurations. Figure 2.4 shows that there are five trees with three parent 
vertices. 

It is easy to see that the number of strictly binary trees with k parent vertices 
is C,. By Exercise 2, every such tree has k + 1 leaves. Label these vertices with 
xo through x; from left to right in the tree. Then the tree determines an order of 
multiplication for the x;. For example, the five trees in Figure 2.4 correspond to 
the multiplications ((x921)22)x3, (%o(@122))x3, (w0a1)(x2%3), Xo((@142)a3), 
and x%9(x1(x22'3)), respectively. Binary trees like these are often used in computer 
science to designate the order of evaluation of arithmetic expressions. 
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ares 
Paes 


FIGURE 2.4. Strictly binary trees with three parent vertices. 


Exercises 
1. Show that every vertex in a rooted tree has at most one parent. 


2. Show that a strictly binary tree having exactly k parent vertices has exactly 
k + 1 leaves. 


3. A diagonal of a convex polygon is a line segment connecting two non- 
adjacent vertices of the polygon. Let p,, denote the number of ways to de- 
compose a convex polygon having n vertices into triangles by drawing n—3 
diagonals that do not cross inside the polygon. Assume that the vertices of 
the polygon are labeled, so that triangulations with different orientations 
are counted separately. 


(a) Determine p3, p4, ps5, and peg by showing all the possible triangula- 
tions. 


(b) Let v be a fixed vertex of a polygon with n = 7 sides. Count all the 
triangulations of the heptagon by considering two cases: (i) v is not an 
endpoint of any of the four diagonals added in a triangulation, and (ii) 
v is an endpoint of at least one of the diagonals. Use this to determine 
the value of p7 without drawing every possible triangulation. 


(c) Determine a formula for py. 


4. A staircase of size n is a path in the plane from the origin to the point (n, 7) 
consisting of exactly n horizontal and n vertical steps, each of length 1, with 
the added condition that the path never rises above the line y = z. Let s,, 
denote the number of staircases of size n. For example, s; = 1 since the 
only staircase is li Also, s2 = 2 since the only possible staircases are 
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5. 
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etleaede. 


(a) Determine s3 and s4 by drawing all the staircases of these sizes. 


(b) For1 < k < n, let s,,;, denote the number of staircases of size n for 
which the first intersection of the path with the y = x line (after the 
origin) is the point (k,k). Determine a formula for s,,;, in terms of 
the numbers s,,,. 


(c) Determine a recurrence relation for s,,, then a formula. 


Let r,, denote the number of mountain ridgelines one can draw using n 
ascending steps and n descending steps. A ridgeline must start and end on 
the horizon, and may never dip below the horizon. For example, r; = 1 
since the only ridgeline is /\, and the following illustrates a valid ridgeline 
with n = 11. 


/\ 
1A MA 
/ VW VA 


Two ridgelines that are mirror images of one another count as different 
arrangements. 
(a) Determine ro, 73, and r4 by drawing all of the possible ridgelines. 


(b) Use a combinatorial argument to determine a recurrence relation for 
Tn, then find a formula for r,,. 


. Suppose 2k people are seated around a table. How many ways are there for 


the & pairs of people to shake hands simultaneously across the table in such 
a way that no arms cross? 


. Show that the coefficient of «* in the Maclaurin series expansion of (1 — 


(1 — 32)/3) /z is 


1 k 
CZs He ashe 


. Use an arithmetic argument to show that (2k)! is divisible by k!(A + 1)!. 


Hint: First compute the number of times a prime number p divides m!. 


Polya’s Theory of Counting 


Who are you who are so wise in the ways of science? 
— Sir Bedivere, in Monty Python and the Holy Grail 
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How many ways can King Arthur and his knights sit at the round table? How 
many different necklaces with n beads can be formed using m different kinds 
of beads? 

Both these questions ask for a number of combinations in the presence of sym- 
metry. Since there is no distinguished position at a round table, seating Arthur 
first, then Gawain, Percival, Bedivere, Tristram, and Galahad clockwise around 
the table yields the same configuration as seating Tristram first, then Galahad, 
Arthur, Gawain, Percival, and Bedivere in clockwise order. Similarly, we should 
consider two necklaces to be identical if we can transform one into the other by 
rotating the necklace or by turning it over. 

Before answering these questions, let us first rephrase them in the language of 
group theory. 


2.7.1 Permutation Groups 


T haven’t fought just one person in a long time. I’ve been 
specializing in groups. 
— Fezzik, in The Princess Bride 


A group consists of a set G together with a binary operator o defined on this set. 
The set and the operator must satisfy four properties. 


e Closure. For every a and bin G,aobisinG. 
e Associativity. For every a, b, andcin G,ao (boc) = (aob)oe. 


e Identity. There exists an element e in G that satisfies eo a = aoe = a for 
every a in G. The element ¢ is called the identity of G. 


e Inverses. For every element a in G, there exists an element b in G such that 
aob=boa=e. The element b is called the inverse of a. 


In addition, if ao b = boa for every a and b in G, we say that G is an abelian, or 
commutative, group. 

For example, the set of integers forms a group under addition. The identity 
element is 0, since 0 + 7 = 7+ 0 = 27 for every integer 2, and the inverse of the 
integer 7 is the integer —7. Similarly, the set of nonzero rational numbers forms a 
group under multiplication (with identity element 1), as does the set of nonzero 
real numbers. 

We can also construct groups of permutations. A permutation of n objects may 
be described by a function 7 defined on the set {1,2,...,n} by ordering the 
objects in some fashion, then taking (7) = j if the ith object in the order- 
ing occupies the 7th position in the permutation. For example, the permutation 
(c, d, a, e, b] of the list [a, b, c, d, e] is represented by the function 7 defined on the 
set {1,2,3,4,5}, with 7(1) = 3, 7(2) = 5, 7(3) = 1, 7(4) = 2, and 7(5) = 4. 
Notice that a function 7 : {1,...,n} — {1,...,n} arising from a permutation 
has the property that 7(i) 4 2(j) whenever i # 7. Such a function is called an 
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injective, or one-to-one, function. The map 7 also has the property that for every 
m with 1 < m <n, there exists a number i such that 7(i) = m. A function 
like this that maps to every element in its range is called surjective, or onto, and a 
function that is both injective and surjective is said to be a bijection. Thus, every 
permutation of n objects corresponds to a bijection 7 on the set {1,2,...n}, and 
every such bijection corresponds to a permutation. 

Let S,, denote the set of all bijections on the set {1,2,...,}. Exercise 4 asks 
you to verify that this set forms a group under the operation of composition of 
functions. For example, the identity element of the group is the identity map 7, 
defined by 7o(k) = k for each k, since 7 0 79 = 79 0 7 = 7 for every z in S,,. 
This group is called the symmetric group on n elements. 

The size of the group S', is the number of permutations of n objects, so |S;,,| = 
n!. Because of our correspondence, we normally refer to an element of S;, as a 
permutation, rather than a bijection. 

To specify a particular permutation z in S,,, we need to name the value of 7(k) 
for each k. This is often written in two rows as follows: 


© (2) (3) ~ nae 


123 4 5 
3.5 1 2 4 
denotes the permutation described earlier. 
We can describe the permutations in a more succinct manner by using cycle 
notation. For example, in the permutation above, 7 sends | to 3 and 3 to 1, and 


sends 2 to 5, 5 to 4, and 4 to 2. So we can think of 7 as a combination of two 
cycles, 1 — 3 — 1 and 2 — 5 — 4 — 2, and denote it by 


For example, 


(13) (254). 


Of course, we could also denote this same permutation by the cycles (542)(31), 
so to make our notation unique, we make two demands. First, the cycle contain- 
ing 1 must appear first, followed by the cycle containing the smallest number 
not appearing in the first cycle, and so on. Second, we require the first number 
listed in each cycle to be the smallest number appearing in that cycle. To simplify 
the notation, cycles of length 1 are usually omitted, so (1253) (4) is written more 
simply as (1253). The identity permutation is denoted by (1). 

The composition of two permutations is computed from right to left. For exam- 
ple, suppose 7; = (13)(254) and m2 = (15423). We determine the composition 
71 © 72 by applying 72 first, then 71. Since 7g sends 1 to 5, and 7 sends 5 to 4, 
the composition 71 © 7 then sends 1 to 4. In the same way, we see that 71 0 72 
sends 4 to 5, 5 to 2, 2 to 1, and 3 to 3. Thus, 71 0 72 = (1452). In cycle notation, 
we denote the composition of two permutations by juxtaposing their cycles, so 


m 0 Mg = (13) (254) (15423) = (1452). 
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Notice that the cycles for 7; appear first, so products of cycles are always com- 
puted from right to left. Also, we calculate that 72 0 7, = (15423) (13)(254) = 
(2435), so in general S,, is not an abelian group. 

A subset H of a group G is called a subgroup of G if H is itself a group under 
the same binary operation. The group S,, contains many subgroups; for example, 
{(1), (12)} is a subgroup of S', for every n > 2. We investigate three particularly 
important subgroups of S;,. 


The Cyclic Group 
If 7 is a permutation in S, and m is a nonnegative integer, let 7” denote the 
permutation obtained by composing 7 with itself m times, so 7°? = (1), and 
tw =TonromT. Let 

(tT) ={n™ : m> Of, (2.48) 


so that (7) is a subset of S,,. In fact (Exercise 5), (7) is a subgroup of S,,, and we 
call this group the cyclic subgroup generated by z in S,. 

The cyclic group C’, is the subgroup of the symmetric group S;,, generated by 
the permutation (123---n), so 


Ca = (123+++n)). (2.49) 


Clearly, C;, contains n elements, since n applications of the generating permuta- 
tion are required to return to the identity permutation. For example, (1234)? = 
(13)(24), (1234)? = (1432), and (1234)* = (1), so 


C4 = {(1), (1234), (13)(24), (1432)}. (2.50) 


The group C;, may be realized as the group of rotational symmetries of a reg- 
ular polygon having n sides. For example, each of the permutations of (2.50) 
corresponds to a permutation of the vertices of Figure 2.5 obtained by rotating the 
square by 0, 90, 180, or 270 degrees. 


The Dihedral Group 


The dihedral group D,, is the group of symmetries of a regular polygon with 
n sides, including reflections as well as rotations. Since C’;, consists of just the 
rotational symmetries of such a figure, evidently C,, is a subgroup of Dy. 

Referring to Figure 2.5, we see that D4 consists of the four rotations of C4, 
plus the four reflections (12) (34), (14)(23), (13), and (24). The first two permu- 
tations represent reflections about the vertical and horizontal axes of symmetry 
of the square; the last two represent flips about the diagonal axes of symmetry. 
In general, if n is even, we obtain n/2 reflections through axes of symmetry that 
pass through opposite vertices, and n/2 reflections through axes that pass through 
midpoints of opposite edges. Combining these with the n rotations of C;,, we find 
that |D,,| = 2n in this case. 
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FIGURE 2.5. A square with labeled vertices. 


1 


FIGURE 2.6. A regular pentagon with labeled vertices. 


Using Figure 2.6, we find that Ds consists of five rotations and five reflections, 


Ds = {(1), (12345), (13524), (14253), (15432), (25)(34), 
(13)(45), (15)(24), (12)(35), (14)(23)}. 


It is easy to see that we always obtain n reflections if n is odd, so |D,| = 2n for 
everyn > 1. 


The Alternating Group 


Every permutation can be expressed as a product of transpositions, which are 
cycles of length 2. For example, the cycle (123) can be written as the product 
(12)(23), and the permutation (1234) (567) can be expressed as the product of six 
transpositions: (12)(23)(34)(56)(67). Such a decomposition is not unique; for 
instance, (123) may also be written as (23)(13), or (12)(23)(13) (13). However, 
the number of transpositions in any representation of one permutation is either 
always an even number, or always an odd number. Exercise 6 outlines a proof of 
this fact. If a permutation 7 always decomposes into an even number of transpo- 
sitions, we say that 7 is an even permutation; otherwise, it is an odd permutation. 
Notice that the identity permutation is even, since it is represented by a product 
of zero transpositions. 
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The alternating group A,, consists of the even permutations of S,,. For exam- 
ple, Az = {(1), (123), (132)} = C3, and 
Aa = {(1), (123), (182), (124), (142), (184), (143), 
(234), (243), (12)(34), (13)(24), (14)(23)}. 


Exercises 7 and 8 ask you to verify that A,, is a group of size |A,,| = n!/2 for 
n > 2, and that A,, is not abelian for n > 5. 


Exercises 
1. Show that the identity element of a group is unique. 
2. (a) Suppose that M is a finite set and f : MM — WM is an injective func- 
tion. Show that f is a bijection. 


(b) Suppose that V isa finite set and f : J — M is asurjective function. 
Show that f is a bijection. 


(c) Show that neither of these statements is necessarily true if M is an 
infinite set. 


3. In each part, determine all values of n that satisfy the statement. 


(a) C, is a subgroup of Ay. 
(b) D,, is a subgroup of A,,. 
(c) Cy, is a subgroup of D,,+1. 
(d) C, is a subgroup of S),41. 


4. Verify that S;, forms a group under composition of functions by checking 
that each of the required properties is satisfied. 


(a) Closure. If 7 and 72 are bijections on {1,2,...,}, show that 71 072 
is also a bijection on {1,2,...,n}. 


(b) Associativity. If 71, 72, and 73 are in S',, show that 71 0 (72 0773) and 
(71 © 7) © 73 represent the same function in S,,. 


(c) Identity. Check that 7) o 7 = 170 79 = 7, for every 7 in S;,. Here, 719 
is the identity map on {1,2,...,n}. 


(d) Inverses. Given a bijection 7 in S,,, construct a bijection a" in Si, 


satisfying tom~!=a-lonr=Tp. 


5. Suppose that G is a group and g is an element of G. 


(a) Show that (g) is a subgroup of G. 
(b) Show that (g) is abelian. 
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6. Let x denote the vector of n variables (21, 72,..., 2»). Define 
Paj= || Ga), 
l<i<j<n 


and if 7 € Sy, let 


P,(x)= |] (xq — mq) 


l<i<j<n 


(a) Show that P,(x) = +P(x). 
(b) Show that P, (x) = —P(x) if 7 is a transposition. 


(c) Conclude that no permutation 7 in S;, can be represented both as a 
product of an even number of transpositions and as a product of an 
odd number of transpositions. 


7. (a) Prove that A, is a group. 
(b) Show that A,, is not abelian for n > 5. 


8. Let n > 2, let B,, denote the set of odd permutations in S,,, and let 7 be a 
transposition in S;,. 


(a) Show that the map T : S, — Sj), defined by T(x) = To7misa 
bijection. 

(b) Show that T maps A,, to B,, and B, to An. 

(c) Conclude that |A,,| = n!/2. 


9. Determine the group of symmetries of each of the following objects. 


(a) The vertices of a regular tetrahedron. 
(b) The vertices of a cube. 


(c) The vertices of a regular octahedron. 


2.7.2. Burnside’s Lemma 


Burnside had submitted the scheme to Meade and myself, and we 
both approved of it, as a means of keeping the men occupied. 
— Personal Memoirs of U. S. Grant 


Armed with our knowledge of permutation groups, we now develop a general 
method for counting combinations in the presence of symmetry. In general, we 
are given a set of objects S, a set of colorings of these objects C’, and a group 
of permutations G representing symmetries possessed by configurations of the 
objects. We consider two colorings in C’ to be equivalent if one of the permutations 
in G transforms one coloring to the other, and we would like to determine the 
number of nonequivalent colorings in C. 
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For example, suppose S = {1,2,3,4} is the set of vertices of the square in 
Figure 2.5, and C is the set of all possible colorings of these vertices using two 
colors, red and green. Let rrgr denote the coloring where vertices 1, 2, and 4 are 
red and vertex 3 is green. Then 


C= {9999, 9997, 9979, 99TT, 9799, 9TGT, GTTG, grrr, 


(2.51) 
Trggg; rggr, rgrg, rgrr, Trgg, rrgr, TTT g, rrrvr}. 


We consider two colorings in C’ to be equivalent if one can be transformed to the 
other by a rotation of the square. For example, rotating the coloring rrgr yields 
the equivalent colorings rrrg, grrr, and rgrr. So we choose G to be the group of 
rotations, C4. A permutation 7 in C4 is a function defined on the set {1, 2,3, 4}, 
but 7 induces a map 7* defined on the set of colorings C’ in a natural way. For 
example, if 7 is the 180-degree rotation (13) (24), then the induced map 7™* rotates 
a coloring by the same amount, so m*(rrgr) = grrr, and m*(grgr) = grgr. 

If c; and cg are two equivalent colorings in C’, so 7*(c1) = cp for some m7 € G, 
we write c; ~ C2. Using the fact that G' is a group, it is easy to verify (Exercise 1) 
that the relation ~ on the set of colorings is 


e reflexive: c ~ ¢ for all colorings c, 
e symmetric: c, ~ c2 implies cz ~ c,, and 
e transitive: c, ~ cg and co ~ c3 implies c, ~ C3. 


A relation possessing these three properties is called an equivalence relation. By 
grouping together collections of mutually equivalent elements, an equivalence 
relation on a set partitions the set into a number of disjoint subsets, called equiva- 
lence classes. Our goal then is to determine the number of equivalence classes of 
C under the relation ~. 

In our example, the group C4 partitions our set of colorings (2.51) into six 
equivalence classes: 


{9999}, 

{g9g9", 99T9, 9T99; rggg}, 
{ggrr, grrg, rggr, rrgg}, 
{grgr, rgrg}, 
{grrr, rgrr, rrgr, rrrg}, 


{rrrr}. 


Therefore, there are just six ways to color the vertices of a square using two colors, 
after discounting rotational symmetries. 

We can now translate the problems from the introduction to this section into 
this more abstract setting. In the round table problem, S is the set of n places at 
the table, G is C,,, and C is the collection of the n! seating assignments. In the 
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necklace problem, S is the set of n bead positions, G is D,,, and C is the collection 
of the m” possible arrangements of the m kinds of beads on the necklace. 

Before presenting a general method to solve problems like these, we introduce 
three sets that will be useful in our analysis. Given a permutation 7 in G, define 
C,, to be the set of colorings that are invariant under action by the induced map 
7G 

C,={ceEC: r(c)=c}. (2.52) 

This set is called the invariant set of 7 in C. Similarly, given a coloring c in C, 
define G, to be the set of permutations 7 in G' for which c is a fixed coloring, 


G.={mEG: n*(c) =c}. (2.53) 


This set is called the stabilizer of cin G. It is always a subgroup of G. Finally, let 
C be the set of colorings in C that are equivalent to c under the action of the group 
G, 
é={rn(c) : rE Gh. (2.54) 
The set ¢ is thus the equivalence class of c under the relation ~. It is also called 
the orbit of c under the action of G. 
For example, if C' is given by (2.51) and G is the dihedral group D4, we have 


ggg = {999", 9979, 9°99, TIGGS 


and 

Goggr = {(1), (13)}. 
Also, 

grgr = {grgr, rgrg} 
and 


Gorgr = {(1), (13)(24), (13), (24)}. 
Notice that in both cases, the product of the size of the stabilizer of a coloring 
with the size of the equivalence class of the same coloring equals the number of 
elements in the group. The following lemma proves that this is always the case. 


Lemma 2.8. Suppose a group G acts ona set of colorings C. For any coloring c 
in C, we have |G,| |t| = |G]. 


Proof. We prove this by showing that every permutation in G' may be represented 
in a unique way as a composition of a permutation in G, with a permutation 


in a particular set P, where |P| = |@|. Suppose there are m colorings in the 
equivalence class of c, € = {c1,C2,.--,Cm}. For each i between 1 and m, select 
a permutation 7; € G such that 7*(c) = c;, and let P = {77, 72,..., 7m}. 


Now let 7 be an arbitrary permutation in G. Then m*(c) = c; for some i, so 
n*(c) = m¥(c). Thus (1; ‘0 7)*(c) = c, and so 1; | om € Ge. Since 7; 0 (m0 
1) = 7, we see that 7 has at least one representation in the desired form. Suppose 
now that 7 = 7; 0 0 = 7; 0T, for some 7; and 77; in P and some o and 7 in Ge. 
Then 7;(o(c)) = mi(c) = ¢; and 7;(T(c)) = cj, so c; = cj, and hence i = 7. 


Therefore, o = T, so the representation of 7 is unique. O 
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The following formula for the number of equivalence classes of C’ under the 
action of a group G is usually named for Burnside (the English mathematician, 
not the American Civil War general), as it was popularized by his book [45]. This 
result was first proved by Frobenius [115], however, and Burnside even attributes 
the formula to Frobenius in the first edition of his textbook [45]. Further details 
on the history of this result appear in Neumann [213] and Wright [288]. 

Briefly, Burnside’s Lemma states that the number of equivalence classes of 
colorings is the average size of the invariant sets. 


Theorem 2.9 (Burnside’s Lemma). The number of equivalence classes N of the 
set C in the presence of the group of symmetries G' is given by 


1 
N=— Ga| 2.55 
a >= Ce! (2.55) 


TEG 


Proof. If P is a logical expression, let [P] be 1 if P is true and 0 if P is false. 
Then 


1 1 * 
ql ALU eo=4 


TEG cEC 


1 
=godvk@=4 


c€EC nrEG 


We applied Lemma 2.8 to obtain the fourth line. O 


We may apply Burnside’s Lemma to solve the problems we described earlier. 
In the round table problem, |G] = n. The invariant set of the identity permutation 
is the entire set of colorings, Ca) = C, and the invariant set of any nontrivial 
rotation 7 is empty, C,; = { }. Therefore, the number of nonequivalent seating 
arrangements is |C'| /n = (n — 1)!. 

To determine the number of nonequivalent necklaces with four beads using 
two different kinds of beads, we calculate |Cay| = 16, Cas)| = |Ci24)| =, 
|Coa2ysa)| = |Casyaa| = [Cassy] = 4 and [Cars| = |Cossa)| = 2. 
Therefore, VN = (16+2-84+3-4+2-2)/8 = 6. Last, we calculate the number 
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of nonequivalent three-bead necklaces using three different kinds of beads. Here, 
[Cay] = 27, |Caz)| = |Cas)| = |Ces)| = 9, and |Ca2s)| = |Casa)| = 3; s0 
N = 60/6 = 10. 


Exercises 


1. Show that ~ is an equivalence relation on C. 
2. Prove that G, is a subgroup of G. 


3. How many different necklaces having five beads can be formed using three 
different kinds of beads if we discount: 


(a) Both flips and rotations? 
(b) Rotations only? 
(c) Just one flip? 


4. The commander of a space cruiser wishes to post four sentry ships arrayed 
around the cruiser at the vertices of a tetrahedron for defensive purposes, 
since an attack can come from any direction. 


(a) How many ways are there to deploy the ships if there are two different 
kinds of sentry ships available, and we discount all symmetries of the 
tetrahedral formation? 


(b) How many ways are there if there are three different kinds of sentry 
ships available? 


5. (a) How many ways are there to label the faces of a cube with the num- 
bers 1 through 6 if each number may be used more than once? 


(b) What if each number may only be used once? 


2.7.3 The Cycle Index 


Lance Armstrong (7), Jacques Anquetil (5), Bernard Hinault (5), 
Miguel Indurain (5), Eddy Merckx (5), Louision Bobet (3), Greg 
LeMond (3), Philippe Thys (3). 

— Multiple Tour de France winners 


To use Burnside’s Lemma to count the number of equivalence classes of a set of 
colorings C’, we must compute the size of the invariant set C’, associated with 
every permutation 7 in a group of symmetries G. A simple observation allows us 
to compute the size of this set easily in many situations. 

Suppose we wish to determine the number of ways to color n objects using 
up to m colors, discounting symmetries on the objects described by a group G. 
If a coloring is invariant under the action of a permutation 7 in G, then every 
object permuted by one cycle of 7 must have the same color. Therefore, if 7 
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has & disjoint cycles, the number of colorings invariant under the action of 7 is 
IC;| = m*. For example, if S is the set of vertices of a square and G = D4, then 
|Coa2sa)| =m, |Caay(sa)| = m”, |Casyayay| = m°, and [Cay(ay(ay(ay| = m*. 
Notice that it is essential to include the cycles of length 1 in these calculations. 
With this in mind, we define the cycle index of a group G of permutations on 
n objects. For a permutation 7 in G, define a monomial /, associated with 7 in 
the following way. If 7 is a product of k cycles, and the 7th cycle has length £;, let 


k 
My = M(t 29, 0<<4%n) = [| ay: (2.56) 
i=l 
Here, £1, 2, ..., Lp are indeterminates. The cycle index of G is defined by 
1 
Pel) = S” M,(x),; (2.57) 
| | TEG 
where x denotes the vector (a1, 72,...,2n). 


For example, for G = D4, we find that 


Mayc2y(a)(4) = @1 
Masaya) = Mayeayay = 2122, 
Maz)34) = Masy2a) = Maayas) = 23, 
M234) = M1432) = £4- 


Therefore, 
Pp,(@1,%2,03,U4) = + (x4 + Qxtxe + 3x2 + 2x4) ; (2.58) 


and 
Po,(@1, £2, 3, XA) = + (x4 1 ne + 2x4) . (2.59) 


By Burnside’s Lemma, the number of ways to color n objects using up to m 
colors, discounting the symmetries of G, is Pg(m,m,...,m). For example, the 
number of equivalence classes of four-bead necklaces composed using m different 
kinds of beads is 


Pp, (m,m,m,m) = § (m* + 2m3 + 3m? + 2m). 


Substituting m = 2, we find there are six different colorings, as before. 

Finally, let us compute the number of twenty-bead necklaces composed of 
rhodonite, rose quartz, and lapis lazuli beads. We must determine the cycle in- 
dex for the group Dao. We find that eight of the rotations, those by 18k degrees 
with k = 1, 3, 7, 9, 11, 13, 17, or 19, are a single cycle of length 20, yielding 
the term 82x29 in the cycle index. Four rotations, k = 2, 6, 14, and 18, make two 
cycles of length 10, contributing 427). Rotations with k = 4, 8, 12, or 16 make 
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four cycles of length 5, adding Agk, and k = 5 or 15 contributes 23. The ro- 
tation with k = 10 yields 73°, and the identity adds 7°. Ten of the reflections, 
the ones about axes of symmetry that pass through midpoints of edges, are each 
represented by ten transpositions, contributing 107°. The other ten reflections, 
flipping about opposite vertices, yield 102723. Therefore, 


Prog (1, ---, 020) = Fy (7° + 1OxZa} + 112d + 20} + 48 + 425 + 82x20) , 


(2.60) 


and the number of different twenty-bead necklaces that can be made using three 
kinds of beads is Pp, (3,...,3) = 87 230 157. 


Exercises 


1. 


Show that the monomial /,, defined in (2.57) has the property that the sum 
a fj =n. 

(a) Determine the cycle index for S4 and for A4. 

(b) Show that Ps, (m,m,m,m) may be written as a binomial coefficient. 


(c) Determine the smallest value of m for which P4,(m,m,m,m) > 
Ps,(m,m,m,m). 


. Determine the number of different necklaces with 21 beads that can be 


made using four kinds of beads. Your equivalence classes should account 
for both rotations and flips. 


. Determine the number of eight-bead necklaces that can be made using red, 


green, blue, and white beads under each of the following groups of symme- 
tries. 


(a) Ds. 


(b) A subgroup of Dg having four elements. How does the answer depend 
on the subgroup you choose? 


. Determine the cycle index for the group of symmetries of the faces of a 


cube, and use this to determine the number of different six-sided dice that 
can be manufactured using m different labels for the faces of the dice. As- 
sume that each label may be used any number of times. 


2.7.4 Polya’s Enumeration Formula 


T have yet to see any problem, however complicated, which, when 
looked at in the right way, did not become still more complicated. 
— Poul Anderson 


We can use the cycle index to solve more complicated problems on arrangements 
in the presence of symmetry. Suppose we need to determine the number of equiv- 
alence classes of colorings of n objects using the m colors y1, y2, ..., Ym, Where 
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each color y; occurs a prescribed number of times. For example, how many dif- 
ferent necklaces can be made using exactly two rhodonite, nine rose quartz, and 
nine lapis lazuli beads? 

Let us define the pattern inventory of the different ways to color n objects 
using m colors with respect to a symmetry group G as a generating function in m 
variables, 


Fo(yi,Ya)+.+59m) = > avy ae? kr, (2.61) 
Vv 
where the sum runs over all vectors v = (n1, 22,..., Mm) of nonnegative integers 


satisfying n1 +n2+---+Nm = n, and ay represents the number of nonequiv- 
alent colorings of the n objects where the color y; occurs precisely n; times. For 
example, if we denote a rhodonite bead by r, a rose quartz bead by gq, and a lapis 
lazuli bead by /, we see that the answer to our question above is the coefficient of 
r7q°l° in the generating function 


Fp.o(7,¢,!) = > Q(i,5,k) rigil®. 
i+j+k=20 
i,j,k>0 

In his influential paper [224] (translated into English by Read [226]), Polya 
found that the cycle index can be used to compute the pattern inventory in a 
simple way. Recall that each occurrence of x, in the cycle index arises from a 
permutation having a cycle of length k, and if a coloring is invariant under this 
permutation, then these & elements must have the same color. So either each of 
the k objects permuted by this cycle has color y1, or each one has color yg, etc. 
In the spirit of generating functions, this choice can be represented by the formal 
sum y* + y& +---+ y*,. Pélya found that substituting this expression for x; for 
each & in the cycle index yields the pattern inventory for the coloring. 


Theorem 2.10 (Pélya’s Enumeration Formula). Suppose S is a set of n objects 
and G is a subgroup of the symmetric group S',. Let Pg(x) be the cycle index 
of G. Then the pattern inventory for the nonequivalent colorings of S' under the 
action of G using colors Y1, Y2, .--, Ym is 


Faly) = Po (> Bis D Yas von >) ; (2.62) 
i=1 i=1 i=1 


The proof we present follows Stanley [262, sec. 7.24]. 


Proof. Let v = (ni, 2,.--,%m) bea vector of nonnegative integers of length m 
whose components sum to n, and let Cy, denote the set of colorings of S where 
exactly n; of the objects have the color y;, for each 7. Let Cy, denote the invariant 
set of Cy under the action of a permutation 7. 

If a permutation z in G' does not disturb a particular coloring, then every ob- 
ject permuted by one cycle of 7 must have the same color. Therefore, |Cy,| is 
the coefficient of yfty5?---y?™ in Mz (d> yi, >> y?,---,  y”), where M, is 
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n1,,n2 


the monomial defined by (2.56). Let yY denote the term y;*y5? ---yym. Then, 
summing over all permissible vectors v, we obtain 


m m m 
Srl = (Sox Sass ay ; 
v i=1 w=1 i=1 


Now we sum both expressions over all 7 € G and divide by |G. On the left side, 
we have 


ENIo-E (BE lon) 


TEG V v TEG 
= diay" 
by Burnside’s Lemma, and this is the pattern inventory (2.61). On the right side, 
using (2.57), we obtain (2.62), the cycle index of G evaluated at x, = 7; y?: 


m 


Fe(y) = a S> M, (Sox dv bess Sx) 


TEG 


m 


= Pe (Seam Yosh on Sout. O 
w=1 


i=1 i=l 
For example, the pattern inventory for nonequivalent four-bead necklaces under 
D4 using colors red (r), green (g), and blue (0) is 
Fp,(r,9,6) = Pp, (r+g+6, +g? +b, +g? +d%, r+ g%+b*) 
=rttgtt+bt+r3gtrg?+r2b+ rb? + 93d + gb? 
+ 2r?g? + 2r?b? + 2976? + 2r7 gb + 2rg7b + 2rgb*. 


The pattern inventory for nonequivalent four-bead necklaces under C4, using the 
same three colors is 


Fo,(r,9,6) = Po, (r +g +, 7? +97 +07, r?+ 9° +d*, r*+9* +4) 
=ritgt+bt4+r3gtrg?+r2b+ rb? + g°b4+ gb? 
+ 2r2g? + 2r?b? + 297b? + 3r2gb + 3rg7b + 3rgb?. 


Notice that there are three nonequivalent necklaces with two red beads, one green 
bead, and one blue bead under C4, but only two under D4. Can you explain this? 

Using (2.60) and Theorem 2.10, we may compute the pattern inventory for 
twenty-bead necklaces composed of rhodonite (1), rose quartz (q), and lapis lazuli 
(1) beads. This pattern inventory is shown in Figure 2.7, where we see that there 
are exactly 231 260 different necklaces with two rhodonite, nine rose quartz, and 
nine lapis lazuli beads. 
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FIGURE 2.7. Pattern inventory for necklaces with twenty beads formed using three kinds 


of beads. 
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Pélya’s enumeration formula has many applications in several fields, including 
chemistry, physics, and computer science. Pélya devotes a large portion of his 
paper [224] to applications involving enumeration of graphs, trees, and chemical 
isomers. 


Exercises 


1. What is the pattern inventory for coloring n objects using the m colors y1, 
Y2,--+» Ym if the group of symmetries is S,,? 


2. Use Pélya’s enumeration formula to determine the number of six-sided dice 
that can be manufactured if each of three different labels must be placed on 
two of the faces. 


3. The hydrocarbon benzene has six carbon atoms arranged at the vertices of a 
regular hexagon, and six hydrogen atoms, with one bonded to each carbon 
atom. Two molecules are said to be isomers if they are composed of the 
same number and types of atoms, but have different structure. 


(a) Show that exactly three isomers (ortho-dichlorobenzene, meta-dichlo- 
robenzene, and para-dichlorobenzene) may be constructed by replac- 
ing two of the hydrogen atoms of benzene with chlorine atoms. 


(b) How many isomers may be obtained by replacing two of the hydrogen 
atoms with chlorine atoms, and two others with bromine atoms? 


4. The hydrocarbon naphthalene has ten carbon atoms arranged in a double 
hexagon as in Figure 2.8, and eight hydrogen atoms attached at each of the 
positions labeled 1 through 8. 


FIGURE 2.8. Naphthalene. 


(a) Naphthol is obtained by replacing one of the hydrogen atoms of naph- 
thalene with a hydroxyl group (OH). How many isomers of naphthol 
are there? 


(b) Tetramethylnaphthalene is obtained by replacing four of the hydrogen 
atoms of naphthalene with methyl groups (CH3). How many isomers 
of tetramethylnaphthalene are there? 
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(c) How many isomers may be constructed by replacing three of the hy- 
drogen molecules of naphthalene with hydroxyl] groups, and another 
three with methyl groups? 


(d) How many isomers may be constructed by replacing two of the hydro- 
gen molecules of naphthalene with hydroxyl groups, two with methyl 
groups, and two with carboxyl groups (COOH)? 


5. The hydrocarbon anthracene has fourteen carbon atoms arranged in a triple 
hexagon as in Figure 2.9, with ten hydrogen atoms bonded at the numbered 
positions. 


6 3 


FIGURE 2.9. Anthracene. 


(a) How many isomers of trimethylanthracene can be formed by replac- 
ing three hydrogen atoms with methyl groups? 


(b) How many isomers can be formed by replacing four of the hydrogen 
atoms with chlorine, and two others with hydroxyl groups? 


6. The molecule triphenylamine has three rings of six carbon atoms attached 
to a central nitrogen atom, as in Figure 2.10, and fifteen hydrogen atoms, 
with one attached to each carbon atom except the three carbons attached to 
the central nitrogen atom. 


FIGURE 2.10. Triphenylamine. 


(a) How many isomers can be formed by replacing six hydrogen atoms 
with hydroxyl groups? 
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(b) How many isomers can be formed by replacing five hydrogen atoms 
with methyl groups, and five with fluorine atoms? 


7. The hydrocarbon tetraphenylmethane consists of four rings of six carbon 
atoms, each bonded to a central carbon atom, as in Figure 2.11, together 
with twenty hydrogen atoms, with one hydrogen atom attached to each car- 
bon atom in the rings except for those attached to the carbon at the center. 


FIGURE 2.11. Tetraphenylmethane. 


(a) How many isomers can be formed by replacing five hydrogen atoms 
of tetraphenylmethane with chlorine? 


(b) How many isomers can be formed by replacing five hydrogen atoms 
with bromine, and six others with hydroxyl groups? 


8. Suppose a medical relief agency plans to design a symbol for their organi- 
zation in the shape of a regular cross, as in Figure 2.12. To symbolize the 
purpose of the organization and emphasize its international constituency, its 
board of directors decides that the cross should be white in color, with each 
of the twelve line segments outlining the cross colored red, green, blue, or 
yellow, with an equal number of lines of each color. If we discount rotations 
and flips, how many different ways are there to design the symbol? 


FIGURE 2.12. Symbol of a relief agency. 
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2.7.5 de Bruijn’s Generalization 


It doesn’t matter what color, well that gets a nope! 
Be it pink, purple, or heliotrope! 
— Boundin’, Pixar Films 


Suppose a jewelry company plans to market a new line of unisex bracelets under 
the brand name OPPOSITES ATTRACT. The bracelets are sold in pairs, for a couple 
to share. Each bracelet consists of n beads, some gold and some silver, and the 
two bracelets in a pair are opposites, in the sense that one can be obtained from 
the other by changing each silver bead to a gold one and each gold to a silver. For 
example, if one bracelet has two adjacent gold beads and n — 2 silver beads, then 
its mate has two adjacent silver beads and n — 2 gold beads. The companion then 
of the all-gold bracelet is the all-silver one. How many different pairs of n-bead 
bracelets are possible in the OPPOSITES ATTRACT product line? 

We have seen that there are exactly six different bracelets for the case n = 4, if 
we discount both rotations and flips. These are represented by the configurations 
9999; 9998, 9988, 98g8, gsss, and ssss of gold and silver beads. This produces 
just four different (unordered) pairs of bracelets for the product line when n = 4: 


gg9g9g9 + 8888, gggs + 9888, ggs88 + ggss, gsgs + gsgs. (2.63) 


Recall that each of the configurations we listed for n = 4 in fact represent an 
equivalence class of the set of two-colorings of the vertices of a square, where 
we consider two colorings to be equivalent if one can be obtained from the other 
by the action of some element of the group of symmetries of the square, D4. 
In the same way, we may consider each of the pairs of bracelets in our product 
line as representing a single set of two-colorings of the square—the union of the 
equivalence classes of the two bracelets in the set. For example, the four pairs 
listed in (2.63) correspond to the following partition of the sixteen ways to color 
the vertices of a square using at most two colors: 


{9999, 888s}, 

{9995, 9989; 9899, 8999, 8859, 8898, 5958, g8ss}, 
{9988, 9889, 8899, 8995}, 

{9s9s, sgsg}. 


This partition is precisely the collection of equivalence classes of two-colorings 
under a different equivalence relation. Now we consider two colorings to be equiv- 
alent if one can be obtained from the other by first performing some geometric 
transformation corresponding to a symmetry of the bracelet, then possibly invert- 
ing all the colors. It is easy to check that this is indeed an equivalence relation. 
We can generalize this problem in the following way. Given a set of objects S, 
a set of colors R, a group G' acting on S, anda group H acting on R. Let C' denote 
the set of colorings of S' using the colors in R, so this is the set of all functions 
from S into R. We consider two colorings in C’ to be equivalent if one can be 
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obtained from the other by first applying a permutation from G on the objects, 
then applying a permutation from H on the colors. Exercise | asks you to verify 
that this does in fact form an equivalence relation on C’. We would like to know 
the number of equivalence classes of C' with respect to G and H. 

In our example with four-bead bracelets, we have that S is the set of vertices 
of a square, R = {g, s} for gold and silver beads, G = D4, and H = Sb, since 
we may either leave the beads unchanged, or swap them. A permutation 7 € G 
induces a map 7* on C in the usual way. For instance, if 7 is the 90-degree rotation 
(1234), then z*(gggs) = sggg. Similarly, a permutation p € H induces a map 
p* on C. For example, if p = (12) then p*(gggs) = sssg. 

Of course, if H is the trivial group consisting only of the identity permutation 
(1), then we can use the cycle index and the enumeration formula of Pélya to 
determine the answer. The Dutch mathematician Nicolaas Govert de Bruijn gen- 
eralized the method of Pélya for arbitrary color groups H, and we describe this 
theory here. The first step is computing the set of equivalence classes of C’ with 
respect to the object group G' which are invariant with respect to a given permu- 
tation of the colors. Our proof follows de Bruijn’s paper [69]. 


Theorem 2.11. Suppose S is a set of n objects, R = {yi,.-.,Ym} is a set of 
m colors, G is a subgroup of the symmetric group Sy, and p € Sim. Let Pg(x) 
denote the cycle index of G. Then the pattern inventory for the colorings of S 
which are nonequivalent with respect to the action of G on S, but invariant with 
respect to the action of p on R, is 


Fa. oly) = Pe(ar(p), a2(p), oes ,On(p)), (2.64) 


where 


k-1 
ane) = Yo [wiv 


pk(j)=j 10 
forl<k<n. 


Proof. Let C denote the set of all colorings of S, so C' is the set of maps from S 
into R. For a particular coloring c € C, let € denote its orbit with respect to the 
group G, so = {a*(c) : 7 € G}. Also, let v(c) = (ni, 2,...,%m), where for 
each 7 the integer n; records the number of elements of S assigned the color y; in 
c, and let y¥(° denote the monomial yi" y3? --- yn. Since v(1*(c)) = v(c) for 
any 7 € G, we may define yY by y¥© = yv, 

Suppose that c is invariant under the action of p, that is p*(c) € @. Since 
we want to find the pattern inventory for the classes of colorings of S' that are 
nonequivalent with respect to G,, but invariant with respect to p, we need to study 
the generating function 


Fe ,p(y) = ye yr, 
p(e)=e 
Since G is a group, it is straightforward to show that the set of all colorings that 
are invariant under p is the union of all the orbits € where p(¢) = ¢. Thus, using 
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Lemma 2.8 we find that 


Fooly) = > i -a > IGely, 


p*(c)Ee 


where G;, is the stabilizer of c in G. Now since p*(c) € @, there exists a permuta- 
tion 7- € G such that p*(c) = 7*(c). Also, the set of permutations {7.07 : 7 € 
G.} is exactly the same as the set {m7 € G: m*(c) = 12 (c)}, so |G-| equals the 
number of permutations in G which have the same effect as p on c: 


IGel = |{m € G: m*(c) = p*(o)} I. 
Let U,, denote the set of colorings c for which 7 and p have the same effect, 
U, ={cEC:n*(c) =p*(o}}. 


Note that if c € U, then automatically p*(c) € ¢. Thus, we find that 


1 
Fooly) = LY: (2.65) 


TEG ceUZ 


Now suppose 7 € G, and z has 4; cycles of length 7, foreach? with] <i<n. 
Let @; denote the length of the ith cycle (when 7 is written in cycle notation 
in the canonical way), and let s; denote the smallest element of the zth cycle. 
For example, ifn = 7 and 7 = (1245)(37)(6), then \y = Ax = Ay = 1, 
Ly 4, bo 2, £3 1, 54 1, s2 3, and s3 = 6. Also, let M;,(@1,---,2n) 
denote the monomial obtained from 7 as in (2.56), so in the example we have 
M;, (x1, seey x7) = %1XMQL4. 

Suppose that c € U,, so that applying 7 to c has the same effect as applying 
p. If position s; has color y; in c, it follows that position 7~1(s;) has color Yoj)> 


position 7~?(s;) has color y,2(;),..., position 7~‘%~1) (s;) has color y,¢, 1) 3) 
and we require that p*'(j) = j. It therefore follows that 


> yO = My (on(p), 02(p),-++,en(0)); 
c€U_, 


and the theorem follows by combining this with (2.57) and (2.65). O 


We can apply this theorem to our original example on bracelets, where n = 4, 
m = 2,G = D4, and p = (12). Write y: = g for a gold bead, and yz = s 
for a silver one. Then a;(p) = 0, since no color is left unchanged by p. Next, 
a2(p) = yiy2 + yoy: = 2gs, since p?(j7) = j for both j = 1 andj = 2. 
We then find that a3(p) = 0 since p3(1) = 2 and p?(2) = 1, and a4(p) = 
yiyoyiye + yoy1yey1 = 297s”. Using (2.58), we obtain then that 


F'p,,(12) (9, s) = Pp, (0, 29s, 0, 297s”) — Ag°s; 
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and we verify that there are indeed just two four-bead bracelets which are invariant 
under bead swapping, discounting rotations and flips: gsgs and ggss. 

If we introduce another type of bead in this example, say y3 = b for bronze, 
and keep p = (12), then we obtain aj(p) = b, a2(p) = 29s + b?, a3(p) = 6, 
and a4(p) = 297s? + b4, and we calculate that 


Fp, (12) (9, 8, 6) = Pp,(, 29s + b?, b*, 297s” + b*) = 2975? + 2gsb? + 0°. 


The five different configurations in this case are represented by ggss, gsqs, gbsb, 
gsbb, and bbbb. 

We may now use Theorem 2.11 to solve our original problem. We would like 
to obtain the pattern inventory for a set of colorings C' when we account for both 
a group of symmetries G on the objects, and a group H of symmetries on the 
colors. We compute this pattern inventory by averaging the patterns Fg ,(y) over 
all permutations p in 7, then combining the terms that correspond to equivalent 
patterns of colors. 


Theorem 2.12 (de Bruijn’s Enumeration Formula). Suppose S is a set of n ob- 
jects, R = {y1,.--,; Ym} is a set of m colors, G is a subgroup of the symmetric 
group S,, and H is a subgroup of Sm. Then the pattern inventory Fon(y) for 
the colorings of S which are nonequivalent with respect to both the action of G on 
S and the action of H on R is obtained by identifying equivalent color patterns 
in the polynomial 


1 
Fo.n(¥) = Ga SS Fooly); (2.66) 
pCH 
where F.p(y) is given by (2.64). 

We describe one example before providing the proof. With n = 4,m = 2, 

R= {g,s},G = Dg, and H = So, we compute 

Fp,,8.(9, 8) = 3(Po.(g + 5,9 + 87,9° + 8°, 9% + 5°) 
+ Pp, (0, 29s, 0, 297s”)) (2.67) 
= $(91 +: s*) + $(g°s + gs*) + 297s?. 

The color patterns g* and s* are equivalent under the color group H = S2, so we 
let [g*] denote either one of these patterns. Likewise, we let [g?s] denote either of 
the equivalent patterns g*s or gs®. The last pattern, gs, is not equivalent to any 


of the others, so we let [g?s?] designate this single pattern. We obtain the pattern 
inventory by combining the equivalent terms: 


Fp, 89 (9, s) = [9°] + [9°] + 2[g7s?]. 


Proof of Theorem 2.12. Let C' denote the set of all colorings of S using the colors 
of R, and let € denote the orbit of the coloring c under the action of G, so € = 
{x*(c) : « € G}. The group 4H acts on the set of equivalence classes {@ : c € C}, 
and we let @ denote the orbit of Z under this action, so 


é = {p* (0): p € HA}. 
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In the example above, if c = ggqs, then 


t= {9998, 9989, 9599, 8999} 


and 
t= {{9998, 9959, 9599, 8999}, {888g, 8898, 8gss, gsss}}; 


if c = gsgs, thent = {gsgs, sgsg} and ¢ = {{gsgs, sgsg}}. 
Employing the notation we introduced in the proof of Theorem 2.9, and using 
Lemma 2.8, we compute 


7 Fee) = Ge Db =a 


peH peH Tt 


=— So |Ady® (2.68) 


Since the color patterns in the set {yY“ : @ € G} are equivalent under H, we 
select one pattern from this set to represent the class ¢, and denote this equivalence 
class of patterns by [yY]. By replacing each term yY in the last line of (2.68) 
by its representative class [yY], we obtain the pattern inventory, 


Fonly) = Soy’). Oo 


Cc 
We can use Theorem 2.12 to determine the number different ten-bead pairs of 
bracelets in the OPPOSITES ATTRACT product line having a given configuration 
of colors. Since 
Pp,o (x) = 5 (aj? + 03 + 4a? + 49 + 523 + 5ax7x$) , 


we compute 


Fp i9,83(9)8) = $(Pod.o (g+s,97+87,...,g°+ 8!) 

+ Pp, (0, 29s, ...,0,29°s°)) 

= 3(9°° + 5°°) + 5(9°s + 95°) + 5(g°s? + 9?s°) 
+ 4(g's° + g°s") + 8(g°s* + g#s°) + 139s”, 


so the pattern inventory for these pairs of bracelet is 


Fp ,o,82(9, 8) = [9'°] + [9°s] + 5[9°s?] + 8[9’s°] + 16[g%s4] + 13[g°s°]. 
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The situation is much simpler if we need only compute the total number of dis- 
tinct colorings with respect to G and H, and we do not need the finer information 
provided by the pattern inventory. For this case, we need only set each y; = 1 in 
F.u(y), and there is no need to compute Fg, #(y). Since this case is so com- 
mon, we describe its solution as a corollary to Theorem 2.12. Its proof is left as 
an exercise. 


Corollary 2.13. Suppose S is a set of n objects, R is a set of m colors, G is 
a subgroup of the symmetric group S,, and H is a subgroup of S». Then the 
number of colorings of S using the colors in R which are nonequivalent with 
respect to both the action of G on S and the action of H on R is 


1 
Ne,u(n,m) = Tr S> Pa(1(p), B2(0),---,Bn(p)), (2.69) 
pe 
where By(p) = Dod jA;(p), with the sum extending over all the positive divisors 
j of k, and X;(p) is the number of cycles of p of length j. 
For example, for our ten-bead bracelet problem with m = 2 and H = Sb, 


we find that the only nonzero values of the \;(g) are Ai((1)(2)) = 2 and 
A2( (12) ) = 1. It follows that ;,( (1)(2) ) = 2 for 1 < k < 10, and 


2 if kis even, 


Be( (12) ) = f if k is odd. 


Therefore, 
ND,o,82(10, 2) = citgart (2,2,...,2) + Pp,,(0,2,...,90, 2)) = 44, 


Last, we return to the problem from earlier sections concerning twenty-bead 
necklaces using rhodonite, rose quartz, and lapis lazuli beads. Using H = ((123)), 
we find that 
3 if3|k, 


exam) = axcasay)= 45 SIE 
and (;((1)) = 3 for each &. Thus, 


NDa,C3 (20, 3) = 4(Ppzo (3, ---;3) + 2Pp.o(0, 0,3, ..-,0, 0,3, 0, 0)) 
= 1Pp,,(3,...,3) = 29076719, 


since none of the variables 73; appears in Pp,,(x). This then is the number of 
different 20-bead necklaces if we discount rotations, flips, and the bead substitu- 
tions rhodonite — rose quartz — lapis lazuli — rhodonite, or rhodonite — lapis 
lazuli — rose quartz — rhodonite. 

Using H = S3 instead, we obtain 


1 if kis odd, 
3 if k is even, 


pe (12)) = 6 (19)) = (29))= | 
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and so 


NDio,83(20, 3) = +(Ppao (8, ---,3) + 2P pao (0, 0,3, ...,0,0, 3,0, 0) 
+ 3Pp,(1,3,--.,1,3)) 
= ¢(87 230 157 + 63519) = 14548 946. 


This is therefore the number of different 20-bead necklaces if we discount rota- 
tions, flips, and any permutation of the bead types. 


Exercises 


1. Suppose that a group G' acts on a set S of objects, and a group H acts on 
a set R of colors. Let C denote the set of functions from S into R, that is, 
the number of colorings of S' using the colors in R. If c, and cg are two 
colorings in C’, write c; ~ cg if there exists an element g € G and an 
element h € H such that applying g to the underlying objects of c;, then h 
to its colors, produces cz. Show that ~ induces an equivalence relation on 


C. 


2. Suppose that G is a group acting on a set of objects S, and that C is the set 
of colorings of elements of S using the colors in a set R. Let € denote the 
orbit of c in C’ with respect to the action of G. Let p be a permutation acting 
on R. Prove that {c € C': p*(c) € t} =U ace 


3. Compute the number of different pairs of bracelets in the OPPOSITES AT- 
TRACT product line for n = 6,n = 7, andn = 8. 


4. Our jewelry company plans to extend their line of bracelets by introducing 
sets of m bracelets formed using m different colors of beads, so that a set 
may be shared among a group of m people. If one bracelet in a package 
has the coloring c, then the others in the package have the coloring p*(c), 
(p*)?(c), ..., (p*)™*(c), where p is the cyclic permutation (12---m). 
Use D,, for the object group G in each of the following problems. 


(a) Compute the number of different packages of bracelets form = 3 
when n = 6, then n = 7, thenn = 9. 


(b) Compute the number of different packages of bracelets form = 4 
when n = 10, then when n = 12. 


(c) Determine the pattern inventory BF Dn.Cm(Y) for the case m = 3 and 
n = 6, then n = 9. 


5. Compute the pattern inventory Fp, s, (x,y) forn = 6,n = 7, andn =8. 


6. Compute the pattern inventory Fo, 5, (x,y) forn = 6, n = 7, andn = 8. 
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7. Verify that the pattern inventory F Doo 83 (1; 7, !) for 20-bead necklaces with 
three kinds of beads, using the full symmetric group S3 for H, is 
Frpao,85(7, 91) = [2°] + [r?9q] + 10[r?8q?] + 10[r!8qi] + 33[r17q°] 

+ 90[r2? 71] + 145[r2%q4] + 489[r?%q3]] + 430[r*%q7I?] 

+ 406[r2>q°] + 1956[r1>q*1] + 3912[r** 9377] + 1032[r*q°] 

+ 5832[r4q°l] + 14724[r4q417] + 9924[r4q319] + 1980[r73q"] 

+ 13608[r'3q°l] + 40824[r'3q°1?] + 67956[r!3q419] + 3260[r12q°] 

+ 25236[r'2q7l] + 88620[r'*q°l?] + 176484[r12q°13] 

+ 111270[r22q414] + 4262[r14q°] + 37854[r!4q8]] 

+ 151416[r21q707] + 352968 [714413] + 529452[r21 4°14] 

+ 2518 [r20q"?] + 46252[r9q°l] + 208512[r7°q?1?| 

+ 554520[r2°q73] + 971292[r9q®l4] + 583784 [r°g? 1] 

+ 116398[r°q°1?] + 693150[r°q513] + 1386300[r°q"14] 

+ 1940568[r9q°1°] + 782141 [r8q81*] + 2494836 [r8q71°] 

+ 1458578[r8q°19] + 1665912[r7q71°]. 


8. Prove Corollary 2.13. 


9. Consider the symbol of the medical relief agency shown in Figure 2.12. 
Each of the twelve line segments outlining the cross shape must be colored 
red, green, blue, or yellow. 


(a) How many ways are there to design the symbol, if we consider two 
configurations equivalent if one can be obtained from the other by 
some combination of a rotation, flip, and color reversal? A color re- 
versal exchanges red and green, and exchanges blue and yellow. 


(b) How many of these configurations have the same number of edges of 
each color? 


(c) Repeat the first two problems, but this time consider two colorings to 
be equivalent if one can be obtained from the other by either exchang- 
ing red and green, or exchanging blue and yellow, or both. 


(d 


Ww 


Repeat the first two problems, but now consider two colorings to be 
equivalent if one can be obtained from the other by an iterate of the 
cyclic permutation red — green — blue — yellow — red. 


(e 


wm 


Suppose now that black is added as a possible color for a segment 
of the border. How many ways are there to design the symbol, if we 
consider two configurations equivalent if one can be obtained from 
the other by some combination of a rotation, flip, and color reversal? 
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A color reversal exchanges red and green, exchanges blue and yellow, 
and leaves black fixed. 


(f) Repeat the previous problem, but this time consider two colorings to 
be equivalent if one can be obtained from the other by either exchang- 
ing red and green, or exchanging blue and yellow, or both. 


10. Determine the number of ways to color the faces of a cube using the three 
colors maroon, cardinal, and burnt orange, if two colorings are considered 
to be equivalent if one can be obtained from the other by rotating the cube 
in some way in three-dimensional space, and possibly exchanging maroon 
and burnt orange. Then determine the number of such colorings in which 
maroon and burnt orange appear the same number of times. 


11. Determine the number of ways to color the faces of an octahedron using 
the four colors heliotrope, lavender, thistle, and wisteria, if two colorings 
are considered to be equivalent if one can be obtained from the other by 
rotating the octahedron in some way, and possibly exchanging heliotrope 
and lavender, or thistle and wisteria, or both. Then determine the number of 
such colorings in which the number of faces colored heliotrope matches the 
number colored lavender, and at the same time the number of faces colored 
thistle matches the number colored wisteria. 


2.8. More Numbers 


Truly, I thought there had been one number more... 
— William Shakespeare, The Merry Wives of Windsor, 
Act IV, Scene I 


Many questions in combinatorics can be answered by analyzing the number of 
ways to arrange a particular collection of objects into a number of bins, without 
regard to the order of placement. There are four basic kinds of problems of this 
form: The objects may be identical or distinguishable, and similarly for the bins. 
Problems of this form in combinatorics are called occupancy problems. 

We have already studied occupancy problems for the case of distinguishable 
bins. If the objects are identical, then we saw in Section 2.6.2 that the number 
of ways to distribute n objects among k bins is the binomial coefficient (pease : 
This is the same as the number of ways to select n objects from a set of & different 
objects with repetition allowed, and we described the correspondence between 
these two problems in the earlier section. On the other hand, if the objects are 
distinguishable, then the number of ways to distribute n objects among k bins is 
simply k” by the product rule, since each object can be placed in any of the bins. 
For example, consider the problem of determining the number of n-letter words 
that can be formed using an k-letter alphabet. We can model this as an occupancy 
problem by taking the integers between 1 and 7 as our objects, and the & letters of 
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the alphabet as our bins. Each placement of the objects in the bins corresponds to 
an n-letter word: The placement of 1 indicates the first letter, 2 the second letter, 
etc. Furthermore, it is clear that every possible n-letter word can be obtained in 
this way. 

In subsequent sections, we consider some occupancy problems where the bins 
are indistinguishable. We call the bins groups or piles in this case, with the un- 
derstanding that they are always unlabeled. The remaining two basic types of 
occupancy problems each produce important sequences of numbers in combina- 
torics. The problem of arranging a number identical objects into piles gives rise 
to partitions, which are studied in Section 2.8.1. The case of distributing a col- 
lection of distinguishable objects into groups produces the Stirling set numbers, 
discussed in Section 2.8.3, and the Bell numbers of Section 2.8.4. We also study 
two other important combinatorial sequences here: the Stirling cycle numbers in 
Section 2.8.2, and the Eulerian numbers in Section 2.8.5. Both of these are con- 
nected to the structure of permutations. 

We study some important properties of each of these classes numbers, aided 
by generating functions. We also introduce some different kinds of generating 
functions to assist with our derivations. Some analysis illuminates for instance 
some interesting connections between ordinary powers, rising and falling factorial 
powers, and binomial coefficients. 


2.8.1 Partitions 


Whew! Don’t try to eat these so-called chips! 
— Homer Simpson, after choking during a poker game, 
The Simpsons, episode 103, Secrets of a Successful Marriage 


Suppose a winning hand in poker nets you a pot of n identical poker chips, and 
you want to organize your winnings into a number of neat stacks, in order to 
intimidate your opponents. Individual stacks are not labeled or distinguishable in 
any way, except for the number of chips they contain, so an arrangement of chips 
simply corresponds to a collection of positive numbers that sums to n. How many 
ways are there to organize your winnings? 

An arrangement of n identical objects into a number of (unlabeled) piles is 
called a partition of the objects, so we want to know the number of partitions of 
the n objects, or, for short, the number of partitions of n. Let p,, denote this num- 
ber. We might also investigate the number of ways to divide n identical objects 
into a specific number k of piles. Let p,,,, denote this number. Since the piles 
are unlabeled, we can discount the possibility of an empty pile, so it follows that 
Pn = Pn, + Pn,2 +++ +Pn.n forn > 1. For example, Figure 2.13 exhibits the 
fifteen ways to divide n = 7 poker chips into stacks. Thus p7 = 15, and we see 
for instance that p73 = 4 and p74 = 3. Each configuration here is also displayed 
with a list showing the size of the stacks in descending order. We will always de- 
note partitions in this way. It follows that we can define p,, as the number of ways 
to write m as a sum of positive integers, with the summands listed in descending 
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(2, 1,1, 1, 1, 1) (1,.1,.1,-1,.1,:1,.2) 


FIGURE 2.13. The fifteen ways to stack seven poker chips. 


order. For example, the partitions of n = 4 are (4), (3,1), (2,2), (2,1,1), and 
(1,1,1,1). 
We first note some particular values for the p,,,,. As a special case, we set 


_ fi ifk=0, on 
POR V0 GRE, 


SO po = 1. Also, we set pr, = 0 for all k ifn < 0, so p, = 0 if n is negative. 
For n > 1, certainly there is just one way to write n using a single summand, and 
just one way using n summands, so py. = Pry = 1 for n > 1. Further, it is 
impossible to express a positive integer as a sum with zero terms, or more than n 
terms, or a negative number of terms, so we set 


Pnk =9, ifk <Oork>n. (2.71) 
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Thus, for each integer n we have the identity 


Pr = ye (2.72) 
k 


is a partition of n, with the summands in descending order. If a, = 1, then 
(a1,.--,@k—1) is a partition of n — 1, and every partition of n — 1 into k — 1 
parts can be obtained in this way. Thus, the number of partitions of n into k parts, 
where the smallest part is 1, is precisely pn—1,,-1. Suppose then that a, > 2. In 
this case, we see that (a; — 1,...,a, — 1) is a partition of n — k into exactly 
k; parts, and every partition of n — k can be obtained in this way. It follows that 
the number of partitions of n into & parts, where the smallest part is at least 2, is 
Pn—k,k- Lherefore, we find that 


Pn,k = Pn-1,k-1 of Pn—k,k (2.73) 


for n > 1. This recurrence relation, together with the initial condition poo = 1, 
allows us to compute the value of p,,,, for any n and k. A table of these values 
for n < 10 appears in Table 2.3. 


Pnk|kK=O 1 2 3 4 5 6 7 8 DY 10] py 


n=0 


— 
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TABLE 2.3. Number of partitions p,; of n into k parts, and the number of partitions pn 
of n. 


We would like to determine a more efficient way of computing p,,, without 
using (2.73) to determine all of the p,,;,. In order to do this, we first introduce a 
useful way to visualize a partition known as a Young diagram. The Young diagram 
of a partition (a1,...,a%) of n consists of n boxes arranged in k rows, with a; 
boxes in the top row, ag boxes in the second row, and so on, and each row is 
aligned on the left. For example, Figure 2.14(a) illustrates the Young diagram for 
the partition (6, 4,4, 2,1) of nm = 17. This is then much like our stacks of poker 
chips of Figure 2.13, only turned sideways. 

Many texts use arrays of dots instead of arrays of boxes for illustrating par- 
titions, and in this case the diagrams are known as Ferrers diagrams. We find 


2.8 More Numbers 221 


the Young diagrams more convenient to use. (Young diagrams earned a distinct 
name due to their use in visualizing more complicated structures known as Young 
tableaux, where the boxes are filled with integers according to particular rules.) 


(a) \ = (6,4, 4,2, 1). (b) V’ = (5,4,3,3,1,1). 
FIGURE 2.14. The Young diagram for a partition A, and its conjugate ’. 


We now define the conjugate \' of a given partition of n as the partition of n 
obtained by counting the stacks of boxes in the columns of the Young diagram for 
. For example, that the conjugate partition of \ = (6,4, 4, 2, 1) in Figure 2.14(a) 
is = (5,4,3,3,1,1). The diagram for X’ is displayed in Figure 2.14(b). Also, 
the conjugate of the partition of n that consists of all 1s is the trivial partition (7). 

Clearly, different partitions cannot have the same conjugate, and every partition 
is the conjugate of some partition, so the conjugation mapping is a permutation on 
the set of partitions of n. This fact is very useful in establishing properties of the 
numbers p,,,; and p,,. For example, it is immediate that the number of partitions 
of n which have largest summand a; = k is simply p,,%, since conjugating the 
partitions with this property yields precisely the set of partitions of n into exactly 
k parts. 

Next, we consider some generating functions. From our work on the money- 
changing problems of Section 2.6.3, we know that the generating function P(x) 
for the sequence p,, is given by an infinite product, 


P() = II — (2.74) 
k>1 . 
Let ®(x) = 1/P(a), so 
G(x) = |] -2*). (2.75) 
k>1 


Then ®(.) is itself the generating function for some sequence {c,, }. If we imagine 
expanding enough terms of this product to determine c,,, we see that each partition 
(a1,...,@x) of n into distinct parts a; > --- > az contributes (—1)* to c,, and 
these terms determine c,,. Define g.(n) to be the number of partitions of n into an 
even number of distinct parts, and let g,(n) be the number of partitions of n into 
an odd number of distinct parts. It follows that c, = qe(n) — go(n), and so 


®(x) = So (ge(n) = go(n))2”, (2.76) 


n>=0 
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with the understanding that g-(0) = 1 and q,(0) = 0. 
By expanding a number of terms of the product for ®(x), we can compute the 
values of these coefficients up ton = 100: 


@(2) =1l-2—-2 tate! — 2? — 2 +277 + 7% 
og gf a gl we BT Ag Kg? 48? a 100 
Thus, it appears that g-(n) and q,(n) are often equal, and moreover differ by at 
most 1. Euler first established this fact; the proof we exhibit here employs Young 


diagrams and is due to Franklin in 1881 [111]. The reason for the curious name 
of this theorem is explored in Exercise 7. 


Theorem 2.14 (Euler’s Pentagonal Number Theorem). Let n be a nonnegative 
integer, and let qe(n) and qo(n) be defined as above. Then 


(-1)* ifn= SOE 
Je(n) — do(n) = . otherwise. 
Proof. Let A be a partition of n into distinct parts. Let s(A) denote the smallest 
part of A, and let r(A) be the number of consecutive integers in A, starting with 
its largest part. In the Young diagram for X, the number of squares on the bottom 
row is (A), and r(A) is the number of boxes in the diagram that lie on a 45° line 
anchored at the rightmost box. For example, Figure 2.15(a) exhibits a partition of 
n = 23 into five distinct parts. Here r(\) = 3 and s(A) = 2, and the relevant 
boxes for these quantities are marked respectively with x’s and +’s. 


(a) A = (7,6, 5, 3, 2). (b) pp = (8, 7,5, 3). 
FIGURE 2.15. Constructing 4. when s(A) < r()). 


We aim to transform A into another partition yw of n with distinct parts. The 
number of parts of 4 will be either one more or one less than the number of parts 
of A, so one of these two partitions will have an even number of parts, and the 
other will have an odd number. The transformation is described in terms of the 
Young diagram for , and depends on the relative sizes of r(\) and s(A). 

If s(A) < r(A), then we move the boxes in the bottom row of the Young dia- 
gram for \ to the ends of the top s(\) rows of the diagram. Figure 2.15(b) shows 
the resulting partition jz obtained from the partition \ of Figure 2.15(a). On the 
other hand, if s(A) > r(A), then we move the rightmost boxes of the top (A) rows 
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of the diagram for to make a new row at the bottom of the diagram. Figure 2.16 
shows this procedure for A = (9,7,5, 2), yielding w = (8,7,5,2,1). 


a ge 


(a) \ = (9,7, 5, 2). (b) pw = (8, 7,5, 2,1). 
FIGURE 2.16. Constructing 4, when s(A) > r()). 


The procedure for creating ju from fails in some special cases. The first case 
breaks down precisely when s(A) = r(A) and the corresponding boxes in the 
Young diagram overlap, as in Figure 2.17(a). In this case, writing k for r(A), we 
compute that the total number of boxes in the diagram is 


k(3k — 1) 
—<—. 


The second case fails precisely when s(\) = r(A) + 1 and the boxes overlap, as 
in Figure 2.17(b). Again writing k for r(A), we find that 


2k 


_ a k(3k +1) 
i 2a 
j=k+1 
in this case. 
PT tTtTttN 
ERE 
(a) \ = (7,6, 5,4). (b) A = (6,5, 4). 


FIGURE 2.17. Exceptional partitions. 


Since our mapping on Young diagrams is its own inverse (see Exercise 6), it 
follows that it defines a bijection between the set of partitions of n into a distinct 
odd number of parts, and the set of partitions of n into a distinct even number 
of parts, provided that n # k(3k + 1)/2. When n is one of these exceptional 
values, there is exactly one additional partition with an even number of parts if k 
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is even, and exactly one extra partition into an odd number of parts if & is odd. 
The statement then follows. O 


By combining (2.76) with Theorem 2.14, we see that 


®(2) =1+ So(-1 (ce ue + pers ; (2.77) 
k>1 
and so 
1 = P(x)®(x) 
= S> pea® 14+ yay Ce 4 penny) (2.78) 
k>0 k>1 


It follows that the coefficient of x” on the right side of (2.78) is 0 for n > 1. We 
therefore immediately obtain the following result. 


Theorem 2.15. Let n be a positive integer. Then 


Dn + So(-1 (Pn—k(3k—1)/2 + Pn—k(3k-+1)/2) = 9, 
k>1 


that is, 


Pn = Pn-1 + Pn-2 Pn—-5 Pn-7 + Pn—-12 + Pn-15 PERS og (2.79) 


We can now use (2.79) to determine values of p,, without using the recurrence 
(2.73) for the py,,.. For example, using the values of p,, computed in Table 2.3, 
we compute 


Pil = Pio + Po — Po — pa = 42 + 30-11 —-5 = 56, 
then 
Piz = Pil + Pio — P7 — Ps + po = 564+ 42 —15-7+1=77, 


and so on. Table 2.4 displays the values of p,, computed in this way up ton = 50, 
where 


P50 = Pag + Pag — Pas — Paz + p3g + P35 — P2g — P2a + Pis + pio = 204 226. 


We close this section with another interesting fact about the partition sequence. 
In 1918, Hardy and Ramanujan [152] established a remarkable nonrecursive for- 
mula for p,, as the value of a certain convergent series. Their formula was refined 
by Rademacher in 1937 [230]. We do not reproduce this formula here, but we 
mention only that it involves the number 7, a certain complex root of the polyno- 
mial «2* — 1, and the hyperbolic sine function. From this formula, however, one 
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n n Pn 

11 41 44583 
12 42 53174 
13 43 63261 
14 44 75175 
15 45 89134 
16 46 105558 
17 47 124754 
18 48 147273 
19 49 173525 
20 50 204226 


TABLE 2.4. The number of partitions of n. 


can obtain information on the rate of growth of the sequence p,,. Asymptotically, 
the number of partitions of n satisfies 


er V 2n/3 
Pn AnV3 ’ 


where a, ~ b,, means that lim, —.s. Gn/bn = 1. See the book by Andrews [9] for 
the details and a proof, as well as much more information on this rich topic. 


(2.80) 


Exercises 
1. Establish formulas for pp .2, Pnj—1, and Py n—2- 


2. Use (2.73) and Table 2.3 to compute the values of pi1.%, Di2,k, and piz.k 
for each k. 


Use (2.79) and Table 2.4 to compute the value of ps1, then ps2. 
Use Young diagrams to prove that go(n) equals the number of partitions 
of n which are invariant under conjugation, that is, for which A = 2’. 


5. Use generating functions to prove that the number of partitions of n into 
distinct parts equals the number of partitions of n where each part is odd. 


6. Suppose that A is a partition of n, and that A is not an exceptional partition 
like those shown in Figure 2.17. Let jz be the partition obtained by apply- 
ing the procedure described in the proof of Theorem 2.14 on A. Show that 
r(w) < s(p) if and only if r(A) > s(A). Then conclude that this map de- 
fines a permutation on the set of non-exceptional partitions of n into distinct 
parts, and that this permutation is its own inverse. 


7. (a) Show that (2.77) may be written more simply as 


®(z) = Pe ae 


k 


226 


8. 


10. 
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(b) The kth pentagonal number c;, is the number of disks in a pentagonal 
shape formed by stacking a triangular arrangement of 1 + 24 ---+ 
(k — 1) disks atop a square arrangement of k x k disks, as shown in 
Figure 2.18. Determine a closed fomula for the /th pentagonal num- 
ber. Why is Theorem 2.14 called the Pentagonal Number Theorem? 


FIGURE 2.18. Pentagonal numbers: a1 = 1, a2 = 5, and ag = 12. 


Let s,,,, denote the number of partitions of m whose smallest element is k, 
SO Pn = 8n,1 + 8n,2 +°+*+ + n,n. Prove that 


Pn-1 ifk = 1, 
Snjk = : 
Sn—1,k-1 — 8n—k,k-1 tf k > 2. 


Then use this recurrence, together with the base values s,,, = O fork > n 
and po,o = 0, to produce a table of values for the s,,, for 1 <n < 10, 
similar to Table 2.3. 


Prove that p, < Pn—1 + Pn—z2 for n > 1 by considering first the number 
of partitions of n that have at least two parts equal to 1, then the other 
partitions. Then use this to establish that p,, < F,,41 forn > 0, where Fj, 
denotes the kth Fibonacci number. 


A composition of nis a list of positive integers (a1, a2,...,@,) whose sum 
is n, where the order of the integers matters. For example, there are four 
different compositions of n = 3: (3), (2,1), (1,2), and (1,1,1). Let c, 
denote the number of compositions of 7, and let c,,,;, denote the number of 
compositions of n into exactly k parts. 


(a) Compute the value of c,,, for each k and n with 1 < k < n and 
1 <n < 5 by listing all the compositions, and then calculate the 
value of c, forl <n <5. 


(b) Using these examples, conjecture formulas for c,,, and cp, for ar- 
bitrary positive integers n and k. Then prove that your formulas are 
correct. 
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2.8.2 Stirling Cycle Numbers 


The Round Table soon heard of the challenge, and of course it was a 
good deal discussed... 
— Mark Twain, A Connecticut Yankee in King Arthur’s Court 


Suppose King Arthur decides to divide his knights into committees in order to 
better govern Britain. True to his egalitarian nature, he crafts k identical round 
tables for this purpose. How many ways are there to seat n knights at these tables, 
if each table can seat any number of knights, and no table can be empty? Here, we 
count two seating arrangements as different only if some knight has a different 
neighbor on his left side (or his right) in each one. Since the tables are identical, 
the particular table occupied by a group of knights is immaterial. Thus, once a 
group of knights is assigned to a table, we must account for all the possible seating 
arrangements there. From Section 2.7.2, we know that there are (m— 1)! different 
ways to seat m people at one round table. 

Let us represent the n knights by the integers 1 through n, and denote the 
seating of knights Ky, Ko, ..., Ky», in clockwise order around one table by 
(Kk) Ko... Ky). Of course, (K2K3...K,K 1) denotes the same arrangement 
of knights around the table, so to make our notation unique we demand that the 
knight represented by the smallest number appear first in the list. An arrange- 
ment of knights at the & tables is then uniquely represented by a list of k strings 
of integers in parentheses, where each integer between 1 and n appears exactly 
once. For example, with six knights and three tables, we might seat knights 1, 3, 
and 5 in clockwise order around one table, knights 2 and 6 at another table, and 
knight 4 alone at the third table. This arrangement is denoted by (135)(26)(4). 
This is precisely the cycle notation we used to describe a permutation on six ob- 
jects. We see that each seating arrangement of n knights at / tables corresponds to 
a unique permutation 7 € S,, having exactly k& cycles, and every such permutation 
corresponds to a unique seating arrangement. 

We define the Stirling cycle number, denoted by al , to be the number of ways 
to seat n knights at k identical tables, or, equivalently, the number of permutations 
am € S,, having exactly k cycles. These numbers are also known as the signless 
Stirling numbers of the first kind. A signed version of these numbers is also often 
defined by 


s(n, k) = (—1)"-* 4 (2.81) 


but we will employ only the signless numbers here. 
We derive a few properties of the Stirling cycle numbers. First, it is impossible 
to seat n knights at zero tables, unless there are no knights, so 


n 1 ifn=0 
=— > 2.82 
A f ifn > 0. ( ) 
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Second, if there is only one table, then 


I") =(n—-1)!, n>1. (2.83) 


Next, if there are n tables, then each knight must sit at his own table, and if there 
are n — | tables, then one pair of knights must sit at one table, and the others must 
each sit alone. Thus 


n 


1-0 es 


There are no arrangements possible if there are more tables than knights, or a 
negative number of tables, so 


|" =e (2.84) 


and 


i =0 ifk<Oork>n. (2.86) 


Further, because of the correspondence between seating arrangements and per- 


mutations, we have 
S- i =H (2.87) 


k 

Consider now the case n = 4 and k = 2. Suppose one knight, delayed by an 
armor adjustment, picks his place after the first three knights are already seated. 
If the first three knights are seated at one table, then the last knight must sit at the 
second table by himself. The number of arrangements in this case is the number 
of ways to seat the first three knights at one table, so Gl = 2. On the other hand, 
if two of the first three knights occupy one table, and the third sits at the second 
table, then the last knight may then either join the single knight, or the table with 
two knights. There are two possibilities in the latter case, since the fourth knight 
may sit on the left side of either of the knights already at the table. Thus, there 
are 3\;| = 9 possibilities in this case, and we find that (3 = 3[3] + Ed = 
11. Figure 2.19 shows these eleven arrangements when Tristram joins Bedivere, 
Lancelot, and Percival at two tables. 

This technique generalizes to produce a recurrence relation for these numbers. 
To seat n knights at & tables, we can first seat n — 1 knights at k — 1 tables, then 
seat the last knight alone at the kth table. Alternatively, we can seat the first n — 1 
knights at & tables, then insert the last knight at one of these tables. This knight 
must sit on the left side of one of the other n — 1 knights, so there are n — 1 
different places to seat the last knight. Therefore, 


=@-a[ | + [era] n>. (2.88) 
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FIGURE 2.19. Seating Bedivere, Lancelot, Percival, and Tristram at two tables. 


We can use this formula to compute a triangle of Stirling cycle numbers, just as 
we used the addition identity for binomial coefficients to obtain Pascal’s triangle. 
These computations appear in Table 2.5. 

Recall that for fixed n the generating function for the sequence of binomial 
coefficients has a particularly nice form: 5°, id a* = (x +1)". We can use the 
identity (2.88) to obtain an analogous representation for the sequence of Stirling 
cycle numbers. Let G,(x) = >>, [7] #*. Clearly, Go(x) = 1, and for n > 1, 


=(n-1))> a> ale 


k 
= (n—1)Gy_-1(x) + &Gpn_-1(2), 


so G(x) = (w+n—1)Gp_1(2). It is easy to verify by induction that this implies 
that G,,(x) = x(@+1)(24+ 2)---(a@+n—1) =2”. Thus, 


a=) > | ak, (2.89) 
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| | e&=0 1 2 3 4 5 6 7 8 n 
n=0 i i 
1 0 1 1 
2 0 1 1 2 
3 0 a 3 1 6 
4 0 6 11 6 1 24 
5 0 24 50 35-10 1 120 
6 0 120 274 225 8 15 1 720 
ic 0 720 1764 1624 735 4175 21 1 5040 
8 0 5040 13068 13132 6769 1960 322 28 1 | 40320 


TABLE 2.5. Stirling cycle numbers, (7 : 


for n > 0. Therefore, the Stirling cycle numbers allow us to express rising fac- 
torial powers as linear combinations of ordinary powers. Exercise 7 establishes a 
similar connection for the falling factorial powers. 


Exercises 
1. Use (2.88) and Table 2.5 to compute the values of Fl and bal for each k. 


2. Prove that 


1 ifn=0, 
Hei] = -1 ifn=1, 
k 0 ifn>2. 


3. Use a combinatorial argument to show that 


n—-1 


3 Saar 


4. Use a combinatorial argument to determine a simple formula for [tal : 
5. Use a combinatorial argument to show that 
n+1 "[n-k], 
m m—1 
k=0 
for nonnegative integers n and m. 


6. Prove that if n and m are nonnegative integers then 


ei) ala Ga 
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7. Prove that if n > 0 then 
n n—-k|T} k _ k 
a2 = 5 “(-1) He = pee (2.90) 
8. Use (2.89) to prove that if n > 0 then 


> Hl yr Fk = Ila + ky). 
k=1 


k 


Then use this to prove that | equals the sum of all products of n — k 


distinct integers selected from {1,...,—1}. For example, [3] = 1-2-3+ 
1-2-44-1-2-54+1-3-441-3-5+1-4-54+2-3-44+2-3-54+2-4-5+3-4-5 = 225. 


9. Let d(n,k) denote the number of ways to place n knights at k identical 
tables, with at least two knights at each table. For example, Figure 2.19 
shows that d(4, 2) = 3. Set d(0,0) = 1. 


(a) Use a combinatorial argument to show that d(n, i) satisfies the recur- 
rence relation 


d(n,k) = (n—1)(d(n —1,k) + d(n —2,k —1)) 


forn > 1. 


(b) Compute the table of values of d(n,k) for0 < n < 8, similar to 
Table 2.5. 


(c) Prove that if n > 0 then 
‘> d(n, k) = nj, 
k 


where nj denotes the number of derangements of n. 


2.8.3 Stirling Set Numbers 


36 (Roger Federer, 2006-07), 35 (John McEnroe, 1984), 

26 (Stefan Edberg, 1991-92), 25 (Ilie Nastase, 1972-73). 

Most consecutive sets won in Grand 
Slam matches in men’s tennis 


How many ways are there to divide n guests at a party into exactly & groups, if we 
disregard the arrangement of people within each group? Rephrased, this problem 
asks for the number of ways to partition a set of n objects into exactly k nonempty 
subsets, so that each element in the original set appears exactly once among the 
k subsets. For example, there are three ways to partition the set {a,b,c} into 
two nonempty subsets: {a,b}, {c}; {a,c}, {b}; and {b,c}, {a}. There is just one 
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way to partition {a, b,c} into one subset: {a, b, c}, and just one way to partition 
{a, b, c} into three subsets: {a}, {b}, {c}. 

The number of ways to divide n objects into exactly / groups is denoted by 
te Thus, {e\ = 3, and {3} — a = 1. These numbers are called the Stirling 
set numbers, or the Stirling numbers of the second kind. The notation S(n, k) is 
also often used to denote these numbers. 

We begin by listing a few properties of these numbers. First, form > 1 we have 


(}-{)- cm 


since there is only one way to place n people into a single group, and only one 
way to split them into n groups. Second, 


n 1 ifn=0O, 
= 2.92 
{ot f ifn > 0, ‘ 
since one cannot divide n people into zero groups, unless there are no people. 


Third, to divide n people into n — 1 groups, we must pick two people to be in one 
group, then place the rest of the people in groups by themselves, so 


(2-0) cn 


{i =0, ifk<Oork>n. (2.94) 


Next, we set 


k 


Also, the Stirling cycle number [7] distinguishes among the different ways to 


arrange n people within k groups, and the Stirling set number a does not, so 


n n 
= 2.95 
thle ass 
for alln > O and all k. 


We now derive a recurrence relation for ae Suppose we plan to divide n > 1 
people into k groups for a party, and we know that one person will arrive late. We 
could divide the first n — 1 people into & — 1 groups, then place the last person 
in her own group when she arrives, or we can arrange the first n — 1 people into 
k groups, then pick a group for the last person to join. There are esa different 
ways to arrange the guests in the first case, and awe different possibilities in 
the second. Therefore, 


n n—-1 n—-1 
{iba k extent; > he (2.96) 


For example, to partition the set {a, b, c, d} into two subsets, we can place d in its 
own set, yielding {a, b,c}, {d}, or we can split {a, b, c} into two sets, then add d 
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to one of these sets. The latter possibility yields the six different partitions 


{a, b, d}, {c}; {a, c,d}, {b}; {b, c,d}, {a}; 
{a,b}, {c,d}; {a,c}, {b,d}; {b, c}, {a, d}; 


and {4} = 200} +} =7. 


(2.97) 


Using identity (2.96), we can generate the triangle of Stirling set numbers 
shown in Table 2.6. The sequence {b,,} that appears in this table as the sum across 


the rows of the triangle is studied in the next section. 


7 6 1 
25 10 1 

31-90 65 15 1 

63 301 350 140 21 il 
127 966 1701 1050 266 28 ~ 1 | 4140 


CONnoakwNnrF O|” 
oS & &S OOo Oi) & 
ro 

oo 

ou 


TABLE 2.6. Stirling set numbers, {iz}. and Bell numbers, by. 


Exercise 8 analyzes the generating function for the sequence of Stirling set 
numbers ea with n fixed. We can obtain a more useful relation, however, if we 
replace the ordinary powers of x in this generating function with falling factorial 


powers. For fixed n, let 


role) = (ee tea) 


(2.98) 
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Therefore, the Stirling set numbers allow us to express ordinary powers as com- 
binations of falling factorial powers. 

We can derive another useful formula by considering the generating function 
for the numbers {it with f; fixed. Let 


Ne) 


and therefore 


H,(z) = (2.99) 


(1 —2)(1—22)---(1—ka) 
Next, we use partial fractions to expand this rational function. Our calculations 


are somewhat simpler if we multiply by &! first, so we wish to find constants Aj, 
Ao, ..., Ap such that 


Clearing denominators, we have 
k m—1 k 
kia’ = N° Am [[ Q-i2) [[ G-3Ja), 
m=1 j=l j=m+1 
and setting « = 1/m, we obtain 


k! m-1 j k j 
se~ 4m 1 (1-4) TL (-4). 
j=m+1 


j=l 
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sO 


and 


Thus 


and therefore 


n eee k 

i = So(-ye™ (Em (2.100) 
m=0 

for any nonnegative integers n and k;. This produces a formula for the Stirling set 

numbers. For example, we may compute {3} = 4,(3-1° —3-2%+1-3°) = 90. 


Exercises 


1. Use (2.96) and Table 2.6 to compute the values of {oy and tt for each k. 


2. Ahungry fraternity brother stops at the drive-through window of a fast-food 
restaurant and orders twelve different items. The server plans to convey the 
items using either three or four identical cardboard trays, and empty trays 
are never given to a customer. Use (2.96) and your augmented table from 
Exercise | to determine the number of ways that the server can arrange the 
items on the trays. 


3. Use combinatorial arguments to determine simple formulas for ae and 
ee 2 } : 
4. Anew casino game takes ten ping-pong balls, each labeled with a different 


number between 1 and 10, and drops each one at random into one of three 
identical buckets. A bucket may be empty after the ten balls are dropped. 
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(a) Suppose a bet consists of identifying which balls have landed together 


(b 


wm 


in each bucket. For example, a bet may state that one bucket is empty, 
another has just the balls numbered 2, 3, and 7, and the rest are in the 
other bucket. How many bets are possible? 


Suppose instead that a bet consists of identifying only the number 
of balls that land in the buckets. For example, a bet might state that 
one bucket is empty, another has three balls, and the other has seven. 
The numbers on the balls have no role in the bet. How many bets are 
possible? 


5. How many different fifty-character sequences use every character of the 


26-letter alphabet at least once? More generally, how many ways can one 
place n distinguishable objects into & distinguishable bins, if no bin may be 
empty? 


6. Use (2.99) to prove that a equals the sum of all products of n —k integers 


selected from {1,...,k}. For example, {3} =1-1-141-1-241.-1.- 
341-2-241-2-341-3-342-2-242-2-342-3-343-3-3= 90. 


7. Let r;,,, denote the number of ways to divide n people into & groups, with 


at least two people in each group. For example, the list (2.97) shows that 
r4.2 = 3. Set T0,0 = 1. 


(a) Use a combinatorial argument to show that r,,,, satisfies the recur- 


rence relation 
Tn.ko= krn—1,k ae (n = 1)rn—2,h-1 


forn > 1. 


(b) Define r,, forn > 0 by ry, = a Tn,k- Compute the table of values 


of rn, and r,, for0 <n < 8, similar to Table 2.6. 


(c) Determine a formula for r2,,,,, for a positive integer n. 


(d) A rhyming scheme describes the pattern of rhymes in a poem. For 


example, the rhyming scheme of a limerick is (a, a,b, b, a), since a 
limerick has five lines, with the first, second, and last line exhibiting 
one rhyme, and the third and fourth showing a different rhyme. Also, a 
sonnet is a poem with fourteen lines. Shakespearean sonnets have the 
rhyming scheme (a, b, a, b, c,d, c,d, e, f,e, f, g, g); many Petrarchan 
sonnets exhibit the scheme (a, b, b, a, a,b, b, a,c, d, e, c,d, e). Argue 
that r,, counts the number of possible rhyming schemes for a poem 
with n lines, if each line must rhyme with at least one other line. 


8. LetGn(x) = >>, {7 }2*, so Go(x) = 1. Show that G(x) = 2(Gn_i(x)+ 
/,-1(2)) for n > 1, and use this recurrence to compute G4 (x). 
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9. Show that 
A as 
a —1)"-* 2". 2.101 
w 2 { ay ee (2.101) 
10. Use (2.90) and (2.98), or (2.89) and (2.101), to prove the following identi- 
ties. 


n\| {k 1 ifn=m 
=i) = : 2.102 
Ss A tah ) i otherwise. ( ) 


k 
Eb le] coe = reali (2.103) 
7 kJ} |m 0 otherwise. 


11. Prove that ‘ 
oe oe n kia” 
re 
k>0 k 


for any nonnegative integer n. 


12. Suppose {ri,...,r¢} and {s1,...,s¢} are two sets of positive integers, 
f(x) = am (x"3 — x7), and N is a positive integer. Prove that 


for every n with 1 < n < N if and only if f)(1) = 0 for every n with 
1<n<N.Here, f(")(x) denotes the nth derivative of f(x). 


For example, select {1,5,9,17,18} and {2,3,11, 15,19} as the two sets, 
and select N = 4. Then1+5+9+17+18=2+3+4+114+154+19= 
50, 17+ 52+ 92 +177 + 18? = 2? + 32 + 112 + 15? + 19? = 720, 
13 + 53 4+ 9% +179 + 18° = 2? + 33 + 113 + 15% + 19% = 11600, and 
144+54+494+174 + 184 = 24+ 34+ 114 + 154 + 194 = 195684; and 
f(a) = 2-2? 405-23 4-09-24 4917 —g15 4918 — 2! has f( (1) = 0 
forl<n<A4. 


2.8.4 Bell Numbers 


Silence that dreadful bell: it frights the isle... 
— William Shakespeare, Othello, Act II, Scene III 


The Bell number b,, is the number of ways to divide n people into any number of 
groups. It is therefore a sum of Stirling set numbers, 


bn = >~ i (2.104) 
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The first few values of this sequence appear in Table 2.6. 

We can derive a recurrence relation for the Bell numbers. To divide n people 
into groups, consider the different ways to form a group containing one particular 
person. We must choose some number & of the other n — 1 people to join this 
person in one group, then divide the other n — 1 — k people into groups. It follows 


that , 
n- 
bn = bn—1—k: 
ieee 


k 
Reindexing the sum by replacing k with n — 1 — k, then applying the symmetry 
identity for binomial coefficients, we find the somewhat simpler relation 


n—-1l 
n= ( Z on n>1. (2.105) 


Rather than analyze the ordinary generating function for the sequence of Bell 
numbers, we introduce another kind of generating function that is often useful 
in combinatorial analysis. The exponential generating function for the sequence 
{a} is defined as the ordinary generating function for the sequence {a,,/n!}. For 
example, the exponential generating function for the constant sequence a,, = c is 
noo ce" /n! = ce®, and for the sequence a,, = (—1)"nl, it is 1/(1 + z). The 
exponential generating function for the sequence of Bell numbers is therefore 


Eta) = )_ Bn an (2.106) 


bk ontk 
= a +k 


k>0n>0 


-(Sa)(See) 


e* E (2). 


l| 
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Therefore, 
(In E(2))' =e°, 
and so 
In E(x) = e* +c 
for some constant c. Since E(0) = bo = 1, we must have c = —1. Thus, 


E(x) =e° —!. (2.107) 


We can use this closed form to determine a formula for b,,. Using the Maclaurin 
series for the exponential function twice, we find that 


E(x) = -e® 


Therefore, 


1 kn 
by = — > 7 (2.108) 


This formula is sometimes called Dobirski’s formula [79]. 


Exercises 
1. How many ways are there to put ten different dogs into pens, if each pen 
can hold any number of dogs, and every pen is exactly the same? 
2. Determine a closed form for the exponential generating function for each 
of the following sequences. 
(a) ay, = c*, with c a constant. 
(b) ax = 1if k is even and 0 if k is odd. 
(c) ak = k. 
(d) ax, = k”, for a fixed nonnegative integer n. The number of terms in 
the answer may depend on n. 
3. Verify that equation (2.108) for b,, produces the correct value for bo, b1, 
and bo. 
4. Show that the series in equation (2.108) converges for every n > 0. 
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5. Use a combinatorial argument to show that 
(HEC 
mf k m—1 
k 
7 k m—1fJ’ 
k 


for n > 1, and use this to derive the recurrence (2.105) for Bell numbers. 


(2.109) 


6. Define the complementary Bell number bn forn > 0 by 


Wilf asked if Dn = 0 for infinitely many n, or if there even exists an integer 
n > 2 where b,, = 0. The first few complementary Bell numbers are 1, —1, 
0, 1,1, —2, —9, —9, 50, 267, 413, —2180, —17731, —50533, and 110176. 


(a) Describe a combinatorial interpretation of On: 


(b) Use (2.109) to determine a recurrence for the complementary Bell 
numbers. Then determine a closed form for their exponential generat- 
ing function, E (a). How is this function related to the function E(x) 
of this section? 


(c) Use E (x) to determine a formula for bn, similar to the expression 
(2.108) for by. 


It is known that the sequence Bn changes sign infinitely often, and that Bn F 
O for almost all values of n. See Yang [289] and de Wannemacker, Laffey, 
and Osburn [71] for more information on this problem. 


7. Suppose P(x) is the exponential generating function for the sequence {p,, }, 
and (Q(x) is the exponential generating function for {q,,}. Prove that the 
product P(x)Q(a) is the exponential generating function for the sequence 


{yok (Peta eh 


8. Let r,, denote the number of rhyming schemes for a poem with n lines, 
if each line must rhyme with at least one other line, as in Exercise 7d of 
Section 2.8.3. Recall that ro = 1. 


(a) Prove that 
n—2 n—1 
m= ("y ) re 
k=0 


(b) Determine a closed form similar to (2.107) for the exponential gener- 
ating function R(«) for the sequence {rp }. 
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(c) Use this generating function, together with Exercise 7, to show that 
n 
= —1)"*b,. 
m= (f)our es 


(d) Prove that the number of rhyming schemes for n + 1 lines in which 
each line rhymes with at least one other line equals the number of 
rhyming schemes for n lines in which at least one line rhymes with 
no other line. Note that b,, is the total number of rhyming schemes on 
n lines, including schemes where some lines rhyme with no others. 


9. Let F(a) denote the exponential generating function for the sequence of 
Stirling cycle numbers with k; fixed, 


n>0 
Prove that ie 
Ej(2) = 21), 
1-2 


for k > 1, and use this to derive a closed form for F(x), 
nja"™ (-1)* k 
» Hl = ay (int - 2)". (2.110) 
n>0 


Comtet [60] uses this identity, together with (2.100) and (2.113), to derive a 
complicated formula due to Schloémilch for the Stirling cycle numbers. We 
include it here without proof: 


B-Boy ieay 


m=0 
(2.111) 
a yk n—-l+m 2Qn—k m)\ jr—k+m 
ae k-1 n—-k—-—m/\j mi 
(2.112) 


10. Use an argument similar to that of Exercise 9 to prove that 
n) a” 1 k 
— = —(e*-1 2.113 
2G {i} n! k! ( ) ( ) 
n>=0 


for every k > 0. 
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2.8.5 Eulerian Numbers 


3 (Al Hamilton), 7 (Paul Coffey), 11 (Mark Messier), 17 (Jari 
Kurri), 31 (Grant Fuhr), 99 (Wayne Gretzky). 
Retired jersey numbers, Edmonton Oilers 


Suppose that a pipe organ having n pipes needs to be installed at a concert hall. 
Each pipe has a different length, and the pipes must be arranged in a single row. 
Let us say that two adjacent pipes in an arrangement form an ascent if the one on 
the left is shorter than the one on the right, and a descent otherwise. Arranging 
the pipes from shortest to tallest yields an arrangement with n — 1 ascents and no 
descents; arranging them from tallest to shortest results in no ascents and n — 1 
descents. 

Whether for aesthetic or acoustical reasons, the eccentric director of the concert 
hall demands that there be exactly & ascents in the arrangement of the n pipes. 
How many ways are there to install the organ? The answer is the Eulerian number 
eee Stated in more abstract terms, (R) is the number of permutations 7 of the 
integers {1,...,} having 7(z) < a(¢ + 1) for exactly & numbers i between 1 
andn — 1. 

We list a few properties of these numbers. It is easy to see that there is only one 
arrangement of n pipes with no ascents, and only one with n — 1 ascents, so 


(3) ie. yoy (2.114) 


and 


n-1 


( = y=1, n>1. (2.115) 


The Eulerian numbers have a symmetry property similar to that of the binomial 
coefficients. An arrangement of n pipes with k ascents has n — 1 — k descents, so 
reversing this arrangement yields a complementary configuration with n — 1—k 
ascents and k descents. Thus, 


mr nm 
Cha ou) (2.116) 


Next, by summing over k we count every possible arrangement of pipes precisely 


once, so 
n 
s (?) =n. (2.117) 


k 
We also note the degenerate cases 


(7) =o, ifn >0, andk <Oork>n, (2.118) 


and 


(?) =0, ifk 40. (2.119) 
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We can derive a recurrence relation for the Eulerian numbers. To arrange n 
pipes with exactly k ascents, suppose we first place every pipe except the tallest 
into a configuration with exactly / ascents. Then the tallest pipe can be inserted 
either in the first position, or between two pipes forming any ascent. Any other 
position would yield an additional ascent. There are therefore k + 1 different 
places to insert the tallest pipe in this case. Alternatively, we can line up the n — 1 
shorter pipes so that there are / — 1 ascents, then insert the last pipe either at the 
end of the row, or between two pipes forming any descent. There are n — 2 — (k — 
1) = n—k-—1 descents, so there are n — k different places to insert the tallest 
pipe in this case. It is impossible to create a permissible configuration by inserting 
the tallest pipe into any other arrangement of the n — 1 shorter pipes, so 


(aan y+, n>1. (2.120) 


For example, C) = 2(7) + 2;) = 4, and ‘ea = aS) + a) =34+8=11. 
Figure 2.20 shows these eleven arrangements of four pipes with two ascents. 

We can use the recurrence (2.120) to compute the triangle of Eulerian numbers, 
shown in Table 2.7. 


|A=0 1 2 3 4 5 6 7 nl 
n=0 1 I 
1 i 1 
2 : 4 2 
3 1 4 1 6 
4 1 i ll 1 24 
5 1 26 66 26 1 120 
6 1 57 302 302 b7 1 720 
7 1 120 1191 2416 1191 120 1 5040 
8 1 247 4293 15619 15619 4293 247 1 | 40320 


TABLE 2.7. Eulerian numbers, ee 


Next, we study some generating functions involving the Eulerian numbers. Re- 
call that in Section 2.6.5 we computed the generating function for the sequence 
{0,1,2,3,...} by differentiating both sides of the identity }7,..., ck = x, then 
multiplying by z: 


d d 1 x 
So kat So at yaa. 
ae +( . e dx (—) (1 — 2x)?" eee 
k>0 k>0 
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FIGURE 2.20. Four organ pipes with two ascents. 


Clearly, we can obtain a generating function for the sequence of squares {k?} by 
applying the same differentiate-and-multiply operator to (2.121). We find that 


yeaa | wear) 


k>0 
22 1 
="( tao) —o 
_ Bl +2) 
~ (l=2)3° 


In the same way, we may use this operator to calculate the generating function 
for the sequence of cubes, then fourth powers and fifth powers. After a bit of 
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simplifying, we find that 


34 e(1+4¢+4+ 2”) 
ras a oe (2.123) 
k>0 

4% tL lle+ 112? + 23) 
dk Me =, ‘al = x) ’ (2.124) 
k>0 

5k  (l+262+ 66x? + 26a? + 2+) 
Yr Ls = ae (2.125) 
k>0 


A glance at Table 2.7 shows that the coefficients appearing on the right side of 
these formulas are all Eulerian numbers, and we would suspect that the numbers 
ee will appear in the generating function for the sequence of nth powers of inte- 
gers. This is in fact the case. 


Theorem 2.16. [fn > 0 then 


nk - TONS ie 
yeas day =i) ; (2.126) 


k>1 


Proof. We use induction on n. The formula is easy to verify when n = 0, so we 
assume it holds for a nonnegative integer n. We calculate 


yee kp. z( Des 7 
k>1 k>1 


“5 (EG) 
(eh; )(e+ 1) tte (i) 
= =a (c= (Res pat won (,",)s4] 


- = (Deen( pt +o (n+1—k 1." )=") 
rr ae 


The last step follows from the recurrence relation (2.120). O 
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We can use (2.126) to obtain a formula for Cc 
and powers. We calculate 


(ja = GES ma 


) in terms of binomial coefficients 


k m>1 

= So(m41)"2 oe 1)? a9 

m>0 

(2.127) 

_ > He ee ') (m+1)"2it™ 

as 
ye (ME erase! 

k>0 j=0 


Now the first and last expressions in (2.127) are power series in x, sO we can 
equate coefficients to obtain a formula for the Eulerian number eu +e We find that 


= (1) ie ‘e+ Ly; (2.128) 


j=0 


Last, we derive one more interesting identity involving Eulerian numbers, bi- 
nomial coefficients, and ordinary powers. Consider a sort of generating function 
for the sequence { es )} with n fixed, where we use the binomial coefficient ae) 


in place of x”. Let 
n\ (a+k 
k 


so that Fo(x) = 1. For n > 1, we calculate 


ri (Coal seal) C29 
; ( 
( 
7 


ea) es) 
en) teem OS) 


arias 


dk 
Dee 


Combining the two sums on the right, and replacing the term 
(7**) + (214), we find that 


re S(%"N(o(C2") etn) 


by the sum 
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Therefore, Fy (x) = @-, SO we obtain 
k k nm ‘ ; : 


This is known as Worpitzky’s identity [287]. Thus, Eulerian numbers allow us 
to write ordinary powers as linear combinations of certain generalized binomial 
coefficients. For example, «4 = [) + ae) + i) + aoa 


Exercises 


1. Use an ordinary generating function to find a simple formula for ys and 
verify your formula using (2.128). 


2. Let E,,(a) denote the polynomial 


Use (2.126) to show that the exponential generating function for the se- 
quence of polynomials {E,,(2)}n>0o is 


1-2 


That is, show that 


B(2,t) = yo 


n! 
n>0 
3. (From [282].) Use (2.126) and Exercise 11 of Section 2.8.3 to prove that 
\ ok na) 
eth 
k k 
for any nonnegative integer n. 
4. Use (2.128) to establish the following identity for n > 1: 


s(-u) (") +1)" =0. 


j=0 
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5. A neurotic running back for an American football team will run between 
two offensive linemen only if the jersey number of the player on the left is 
less than the jersey number of the player on the right. The player will not run 
outside the last player on either end of the offensive line. The coach wants 
to be sure that the running back has at least three options on every play. 
If the coach always puts seven players on the offensive line, and there are 
fifteen players on the team capable of playing any position on the offensive 
line, each of whom has a different jersey number, how many formations of 
linemen are possible? 


2.9 Stable Marriage 


How do I love thee? Let me count the ways. 
— Elizabeth Barrett Browning, Sonnet 43, 
Sonnets from the Portuguese 


Most of the problems we have considered in this chapter are questions in enumer- 
ative combinatorics, concerned with counting arrangements of objects subject to 
various constraints. In this section we consider a very different kind of combina- 
torial problem. 

Suppose we must arrange n marriages between n men and n women. Each 
man supplies us with a list of the women ranked according to his preference; each 
woman does the same for the men. Is there always a way to arrange the matriages 
so that no unmatched man and woman prefer each other to their assigned spouses? 
Such a pairing is called a stable matching. 

Consider a simple example with n = 2. Suppose Aaron prefers Yvonne over 
Zoé, and Bjorn prefers Zoé over Yvonne. We denote these preferences by 


A:Y >Z, 
B:Z>Y. 


Suppose also that Yvonne and Zoé both prefer Aaron over Bjorn, so 


Y:A>B, 
Z:A>B. 


Then the matching of Aaron with Zoé and Bjérn with Yvonne is unstable, since 
Aaron and Yvonne prefer each other over their partners. The preferences of Bjérn 
and Zoé are irrelevant: Indeed, Zoé would prefer to remain with Aaron in this 
case. On the other hand, the matching of Aaron with Yvonne and Bjorn with Zoé 
is stable, for no unmatched pair prefers to be together over their assigned partners. 

The stable marriage problem is a question of existential combinatorics, since it 
asks whether a particular kind of arrangement exists. We might also consider it as 
a problem in constructive combinatorics, if we ask for an efficient algorithm for 
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finding a stable matching whenever one does exist. In fact, we develop just such 
an algorithm in Section 2.9.1. 

The stable marriage problem and its variations have many applications in prob- 
lems involving scheduling and assignments. We mention three examples. 


1. Stable Roommates. 


Suppose 2n students at a university must be paired off and assigned to n 
dorm rooms. Each student ranks all of the others in order of preference. 
A pairing is stable if no two unmatched students prefer to room with each 
other over their assigned partners. Must a stable pairing always exist? This 
variation of the stable marriage problem, known as stable roommates, is 
considered in Exercise 1. 


2. College Admissions. 


Suppose a number of students apply for admission to a number of univer- 
sities. Each student ranks the universities, and each university ranks the 
students. Is there a way to assign the students to universities in such a way 
that no student and university prefer each other over their assignment? This 
problem is similar to the original stable marriage question, since we are 
matching elements from two sets using information on preferences. How- 
ever, there are some significant differences—probably not every student 
applies to every university, and each university needs to admit a number of 
students. Some variations on the stable marriage problem that cover exten- 
sions like these are considered in Section 2.9.2. 


3. Hospitals and Residents. 


The problem of assigning medical students to hospitals for residencies is 
similar to the problem of matching students and universities: Each medi- 
cal student ranks hospital residency programs in order of preference, and 
each hospital ranks the candidates. In this case, however, a program has 
been used to make most of the assignments in the U.S. since 1952. The Na- 
tional Resident Matching Program was developed by a group of hospitals 
to try to ensure a fair method of hiring residents. Since medical students are 
not obligated to accept the position produced by the matching program, it 
is important that the algorithm produce a stable matching. (Since the pro- 
gram’s inception, a large majority of the medical students have accepted 
their offer.) We describe this matching algorithm in the next section. 


Exercises 


1. Suppose that four fraternity brothers, Austin, Bryan, Conroe, and Dallas, 
need to pair off as roommates. Each of the four brothers ranks the other 
three brothers in order of preference. Prove that there is a set of rankings 
for which no stable matching of roommates exists. 
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2. Suppose Mj, and Mz are two stable matchings between n men and n women, 
and we allow each woman to choose between the man she is paired with in 
My, and the partner she receives in M2. Each woman always chooses the 
man she prefers. Show that the result is a stable matching between the men 
and the women. 


3. Suppose that in the previous problem we assign each woman the man she 
likes less between her partners in the two matchings M/, and M2. Show that 
the result is again a stable matching. 


4. The following preference lists for four men, { A, B, C, D}, and four women, 
{W, X,Y, Z}, admit exactly ten different stable matchings. 


3 
B 
A 
D 
C 
Y 


(a) Prove that the matching {(A, X), (B, Z),(C,W), (D, 


(b) Determine the remaining nine stable matchings. 


)} is stable. 


2.9.1 The Gale—-Shapley Algorithm 


Matchmaker, matchmaker, make me a match! 
— Chava and Hodel, Fiddler on the Roof 


In 1962, Gale and Shapley [117] proved that a stable matching between n men 
and n women always exists by describing an algorithm for constructing such a 
matching. Their algorithm is essentially the same as the one used by the hospitals 
to select residents, although apparently no one realized this for several years [143, 
chap. 1]. 

In the algorithm, we first choose either the men or the women to be the pro- 
posers. Suppose we select the men; the women will have their chance soon. Then 
the men take turns proposing to the women, and the women weigh the offers 
that they receive. More precisely, the Gale—Shapley algorithm has three principal 
steps. 


Algorithm 2.17 (Gale-Shapley). Construct a stable matching. 


Input. A set of m men, a set of 2 women, a ranked list of the m women for 
each man, and a ranked list of the nm men for each woman. 


Output. A stable matching that pairs the n men and n women. 


Description. 


Step 1. Label every man and woman as free. 
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Step 2. While some man m is free, do the following. 


Let w be the highest-ranked woman on the preference list of m to 
whom m has not yet proposed. If w is free, then label m and w as 
engaged to each other. If w is engaged to m’ and w prefers m over m’, 
then label ™m’ as free and label m and w as engaged to one another. 
Otherwise, if w prefers m’ over m, then w remains engaged to m’ and 
m remains free. 


Step 3. Match all of the engaged couples. 


For example, consider the problem of arranging marriages between five men, 
Mack, Mark, Marv, Milt, and Mort, and five women, Walda, Wanda, Wendy, 
Wilma, and Winny. The men’s and women’s preferences are listed in Table 2.8. 


1 


2 


3 


4 


5 


Mack 
Mark 
Marv 
Milt 
Mort 


Walda 
Wanda 
Wendy 
Wilma 
Winny 


Winny 
Wanda 
Winny 
Winny 
Wanda 


Milt 
Milt 
Mort 
Mark 
Marv 


Wilma 
Winny 
Walda 
Wilma 
Winny 


Mort 
Marv 
Mack 
Mort 
Mort 


Wanda 
Wendy 
Wanda 
Wanda 
Walda 


Mack 
Mort 
Milt 
Milt 
Mark 


Walda 

Wilma 
Wilma 
Wendy 
Wilma 


Mark 
Mark 
Mark 
Mack 
Milt 


Wendy 
Walda 
Wendy 
Walda 
Wendy 


Marv 
Mack 
Marv 
Marv 
Mack 


TABLE 2.8. Preferences for five men and women. 


First, Mack proposes to Winny, who accepts, and Mark proposes to Wanda, 
who also accepts. Then Marv proposes to Winny. Winny likes Marv much better 
than her current fiancé, Mack, so Winny rejects Mack and becomes engaged to 
Marv. This leaves Mack without a partner, so he proceeds to the second name 
on his list, Wilma. Wilma currently has no partner, so she accepts. Our engaged 


couples are now 


(Mack, Wilma), (Mark, Wanda), and (Marv, Winny). 


Next, Milt proposes to his first choice, Winny. Winny prefers her current partner, 
Mary, so she rejects Milt. Milt proceeds to his second choice, Wilma. Wilma re- 
jects Mack in favor of Milt, and Mack proposes to his third choice, Wanda. Wanda 
prefers to remain with Mark, so Mack asks Walda, who accepts. Our engaged cou- 


ples are now 


(Mack, Walda), (Mark, Wanda), (Marv, Winny), and (Milt, Wilma). 
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Now our last unmatched man, Mort, asks his first choice, Wanda. Wanda accepts 
Mort over Mark, then Mark asks his second choice, Winny. Winny rejects Mark in 
favor of her current partner, Marv, so Mark proposes to his third choice, Wendy. 
Wendy is not engaged, so she accepts. Now all the men and women are engaged, 
so we have our matching: 


(Mack, Walda), (Mark, Wendy), (Marv, Winny), 
(Milt, Wilma), and (Mort, Wanda). 


We prove that this is in fact a stable matching. 


Theorem 2.18. The Gale—Shapley algorithm produces a stable matching. 


Proof. First, each man proposes at most n times, so the procedure must termi- 
nate after at most n? proposals. Thus, the procedure is an algorithm. Second, the 
algorithm always produces a matching. This follows from the observations that a 
woman, once engaged, is thereafter engaged to exactly one man, and every man 
ranks every woman, so the last unmatched man must eventually propose to the 
last unmatched woman. Third, we prove that the matching is stable. Suppose m 
prefers w to his partner in the matching. Then m proposed to w, and was rejected 
in favor of another suitor. This suitor is ranked higher than m by w, so w must 
prefer her partner in the matching to m. Therefore, the matching is stable. O 


We remark that the Gale-Shapley algorithm is quite efficient: A stable match- 
ing is always found after at most n? proposals. (Exercise 8 establishes a better 
upper bound.) 

Suppose that we choose the women as the proposers. Does the algorithm pro- 
duce the same stable matching? We test this by using the lists of preferences in 
Table 2.8. First, Walda proposes to Milt, who accepts. Next, Wanda proposes to 
Milt, and Milt prefers Wanda over Walda, so he accepts. Walda must ask her sec- 
ond choice, Mort, who accepts. Then Wendy proposes to Mort, who declines, so 
she asks Mack, and Mack accepts. Last, Wilma asks Mark, and Winny proposes 
to Marv, and both accept. We therefore obtain a different stable matching: 


(Walda, Mort), (Wanda, Milt), (Wendy, Mack), 
(Wilma, Mark), and (Winny, Marv). 


Only Winny and Marv are paired together in both matchings; everyone else re- 
ceives a higher-ranked partner precisely when he or she is among the proposers. 
Table 2.9 illustrates this for the two different matchings. The pairing obtained 
with the men as proposers is in boldface; the matching resulting from the women 
as proposers is underlined. 

The next theorem shows that this is no accident. The proposers always obtain 
the best possible stable matching, and those in the other group, which we call 
the proposees, always receive the worst possible stable matching. We define two 
terms before stating this theorem. We say a stable matching is optimal for a person 
p if p can do no better in any stable matching. Thus, if p is matched with g in an 
optimal matching for p, and p prefers r over q, then there is no stable matching 
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1 2 3 4 5 


Mack | Winny Wilma Wanda Walda Wendy 


Mark Wanda Winny Wendy Wilma Walda 
Marv Winny Walda Wanda Wilma Wendy 
Milt Winny Wilma Wanda Wendy Walda 
Mort Wanda Winny Walda Wilma Wendy 


Walda | Milt Mort Mack Mark Marv 
Wanda | Milt Marv Mort Mark Mack 
Wendy | Mort Mack Milt Mark Marv 
Wilma | Mark Mort Milt Mack Marv 
Winny | Marv Mort Mark Milt Mack 


TABLE 2.9. Two stable matchings. 


where p is paired with r. Similarly, a stable matching is pessimal for p if p can do 
no worse in any stable matching. So if p is matched with qg in a pessimal matching 
for p, and p prefers q over r, then there is no stable matching where p is paired 
with r. Finally, a stable matching is optimal for a set of people P if it is optimal 
for every person p in P, and likewise for a pessimal matching. 


Theorem 2.19. The stable matching produced by the Gale—Shapley algorithm is 
independent of the order of proposers, optimal for the proposers, and pessimal 
for the proposees. 


Proof. Suppose the men are the proposers. We first prove that the matching pro- 
duced by the Gale—-Shapley algorithm is optimal for the men, regardless of the 
order of the proposers. Order the men in an arbitrary manner, and suppose that a 
man m and woman w are matched by the algorithm. Suppose also that m prefers 
a woman w’ over w, denoted by m : w’ > w, and assume that there exists a 
stable matching M with m paired with w’. Then m was rejected by w’ at some 
time during the execution of the algorithm. We may assume that this was the first 
time a potentially stable couple was rejected by the algorithm. Say w’ rejected m 
in favor of another man mm’, so w’ : m’ > m. Then m’ has no stable partner he 
prefers over w’, by our assumption. Let w” be the partner of m’ in the matching 
M. Then w” 4 w’, since m is matched with w’ in M, and som’ : w’ > w”. But 
then m’ and w’ prefer each other to their partners in M, and this contradicts the 
stability of MM. 

The optimality of the matching for the proposers is independent of the order of 
the proposers, so the first statement in the theorem follows immediately. 

Finally, we show that the algorithm is pessimal for the proposees. Suppose 
again that the men are the proposers. Assume that ™m and w are matched by the 
algorithm, and that there exists a stable matching / where w is matched with a 
man m’/ and w : m > m’. Let w’ be the partner of m in M. Since the Gale— 
Shapley algorithm produces a matching that is optimal for the men, we have m : 
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w > w’. Therefore, m and w prefer each other over their partners in M, and this 
contradicts the stability of 17. O 


Exercises 


1. Our four fraternity brothers, Austin, Bryan, Conroe, and Dallas, plan to 
ask four women from the neighboring sorority, Willa, Xena, Yvette, and 
Zelda, to a dance on Friday night. Each person’s preferences are listed in 
the following table. 


Austin | Yvette Xena Zelda Willa 
Bryan | Willa Yvette Xena Zelda 
Conroe | Yvette Xena Zelda Willa 
Dallas | Willa Zelda Yvette Xena 


Willa Austin Dallas Conroe’ Bryan 
Xena Dallas Bryan Austin Conroe 
Yvette | Dallas Bryan Conroe Austin 
Zelda Austin Dallas Conroe’ Bryan 


(a) What couples attend the dance, if each man asks the women in his 
order of preference, and each woman accepts the best offer she re- 
ceives? 


(b) Suppose the sorority hosts a “Sadie Hawkins” dance the following 
weekend, where the women ask the men out. Which couples attend 
this dance? 


2. Determine the total number of stable matchings that pair the four men Axel, 
Buzz, Clay, and Drew with the four women Willow, Xuxa, Yetty, and Zizi, 
given the following preference lists. 


Axel Yetty Willow Zizi Xuxa 
Buzz Yetty Xuxa Zizi Willow 
Clay Zizi Yetty Xuxa Willow 
Drew Xuxa = Zizi Willow Yetty 


Willow | Buzz Drew Axel Clay 
Xuxa Buzz Axel Clay Drew 
Yetty Drew Clay Axel Buzz 
Zizi Axel Drew Buzz Clay 


3. Determine a list of preferences for four men and four women where no one 
obtains his or her first choice, regardless of who proposes. 
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4. Determine a list of preferences for four men and four women where one 
proposer receives his or her lowest-ranked choice. 


5. Determine a list of preferences for four men and four women where one 
proposer receives his or her lowest-ranked choice, and the rest of the pro- 
posers receive their penultimate choice. 


6. Suppose that all the men have identical preference lists in an instance of the 
stable marriage problem. Show that there exists exactly one stable matching 
by completing the following argument. Let M be the matching obtained 
by the Gale-Shapley algorithm using the men as proposers, and suppose 
another stable matching 1’ exists. Among all women who change partners 
between MM and M’, let w be the woman who ranks lowest on the men’s 
common preference list. Suppose m and w are matched in M/, and m and 
w’ in M’. Determine a contradiction. 


7. Suppose that the preference lists of the men mj, ..., m, and the women 
W1,.++, Wn have the property that m; ranks w; ahead of each of the women 
Wi+1, +++) Wn, and w; ranks m; ahead of each of the men m;41, ..., Mn, 
for each 7. 


(a) Show that the matching (m1, w1), ..., (™n, Wn) is stable. 


(b) (Eeckhout [86].) Show that this is the unique stable matching in this 
case. 


(c) Prove that there are (n!)”~' different sets of preference lists for m1, 
..., My, that have the property that m,; ranks w; ahead of each of the 
women W;+1,-.-, Wn, for each 7. 


(d) Prove that at least 1/n! of the possible instances of the stable marriage 
problem for n couples admits a unique solution. 


8. (Knuth [178].) Prove that the Gale-Shapley algorithm terminates after at 
most n? — n + 1 proposals by showing that at most one proposer receives 
his or her lowest-ranked choice. 


9. Suppose that more than one woman receives her lowest-ranked choice when 
the men propose. Prove that there exist at least two stable matchings be- 
tween the men and the women. 


2.9.2 Variations on Stable Marriage 


I want what any princess wants—to live happily ever after, with the 
ogre I married. 
— Princess Fiona, Shrek 2 


The stable marriage problem solves matching problems of a rather special sort. 
Each member of one set must rank all the members of the other set, and the two 
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sets must have the same number of elements. In this section, we consider several 
variations of the stable marriage problem, in order to apply this theory much more 
broadly. In each case, we study two main questions. First, how does the change 
affect the existence and structure of the stable pairings? Second, can we amend 
the Gale-Shapley algorithm to construct a stable matching in the new setting? 


Unacceptable Partners 


Suppose each of n men and n women ranks only a subset of their potential mates. 
Potential partners omitted from a person’s list are deemed unacceptable to that 
person, and we do not allow any pairing in which either party is unacceptable 
to the other. Clearly, we cannot in general guarantee even a complete matching, 
since for instance a confirmed bachelor could mark all women as unacceptable. 
This suggests a modification of our notion of a stable matching for this problem. 
We say a matching (or partial matching) M is unstable if there exists a man m and 
woman w who are unmatched in /, each of whom is acceptable to the other, and 
each is either single in MM, or prefers the other to their partner in 17. We will show 
that every such problem admits a matching that is stable in this sense, and further 
that every stable matching pairs the same subcollection of men and women. We 
first require a preliminary observation. We say a person p prefers a matching MM, 
over a matching Mo if p strictly prefers his or her partner in Mj, to p’s match in 
Mp. 


Lemma 2.20. Suppose M, and Mg are stable matchings of n men and n women, 

whose preference lists may include unacceptable partners. If m and w are matched 
in My, but not in Mo, then one of m or w prefers M, over Mo, and the other 
prefers Mz over My. 


Proof. Suppose mo and wy are paired in MM, but not A/2. Then mo and wo cannot 
both prefer M1, since otherwise M2 would not be stable. Suppose that both prefer 
Mg. Then both have partners in M3, so suppose (mo, w1) and (m1, wo) are in Mo. 
Both mo and w, cannot prefer M2, since M, is stable, so w, must prefer /, and 
likewise m, must prefer 1/7). These two cannot be paired in (1, so denote their 
partners in Ad, by mg and wz. By the same reasoning, both of these people must 
prefer M2, but cannot be matched together in M2, so we obtain m3 and w3, who 
prefer 14;, but are not paired to each other in (4;. We can continue this process 
indefinitely, obtaining a sequence mo, Wo, M2, W2, M4, Wa, ... Of distinct men 
and women who prefer Mz over Mj, and another sequence m1, w1, 3, W3,... 
of different people who prefer MM, over Mz. This is impossible, since there are 
only finitely many men and women. O 


We can now establish an important property of stable matchings when some un- 
acceptable partners may be included: For a given set of preferences, every stable 
matching leaves the same group of men and women single. 


Theorem 2.21. Suppose each of n women ranks a subset of n men as potential 
partners, with the remaining men deemed unacceptable, and suppose each of the 
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men rank the women in the same way. Then there exists a subset Xo of the women 
and a subset Yo of the men such that every stable matching of the n men and n 
women leaves precisely the members of Xo and Yo unassigned. 


Proof. Suppose My and Mp are distinct stable matchings, and suppose ™j is 
matched in MM, but not in Mz. Let w 1 be the partner of m; in Mj. Since m4 
clearly prefers M1 over M2, by Lemma 2.20 w, must prefer Mz over Mj. Let 
mz be the partner of w; in Mz. Then m2 prefers Mj, and so his partner wz in 
M, must prefer M2 over M,. Continuing in this way, we obtain an infinite se- 
quence (m1, w1), (m2, we), (m3, ws), ... of distinct couples in 1, (and another 
sequence (m2, w1), (m3, We), (ma, wz), ... in M2), which is impossible. O 


We still need to show that at least one stable matching exists, and we can do 
this by altering the Gale-Shapley algorithm for preference lists that may include 
unacceptable partners. We require just two modifications. First, we terminate the 
loop either when all proposers are engaged, or when no free proposer has any 
remaining acceptable partners to ask. Second, proposals from unacceptable part- 
ners are always rejected. It is straightforward to show that this amended procedure 
always produces a stable matching (see Exercise 1). We can illustrate it with an 
example. Suppose the four men Iago, Julius, Kent, and Laertes each rank a subset 
of the four women Silvia, Thaisa, Ursula, and Viola, and each of the women ranks 
a subset of the men, as shown in Figure 2.21. Potential partners omitted from a 
person’s list are deemed unacceptable to that person, so for example Iago would 
not consider marrying Thaisa or Ursula. 


FIGURE 2.21. Preferences with unacceptable partners. 


Suppose the men propose. Iago first asks Viola, but she rejects him as an unac- 
ceptable partner, so he asks Silvia, who happily accepts. Next, Julius asks Silvia, 
who rejects him in favor of Iago, so he proposes to Viola, who now accepts. Ur- 
sula then rejects Kent, then Thaisa accepts his proposal. Finally, Laertes proposes 
to Silvia, then Thaisa, then Viola, but each rejects him. Our stable matching is 
then (Iago, Silvia), (Julius, Viola), and (Kent, Thaisa). The set Xo of unmatch- 
able bachelorettes contains only Ursula, and Yo = {Laertes}. 

We have shown how to adapt the Gale-Shapley algorithm to handle incomplete 
preference lists, but we can also describe a way to alter the data in such a way that 
we can apply the Gale-Shapley algorithm without any modifications. To do this, 
we introduce a fictitious man to mark the boundary between the acceptable and 
unacceptable partners on each woman’s list, and similarly introduce a fictitious 
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woman for the men’s lists. We’ll call our invented man the ogre, and our fictitious 
woman, the ogress. Append the ogre to each woman’s ranked list of acceptable 
partners, then add her unacceptable partners afterwards in an arbitrary order. Thus, 
each woman would sooner marry an ogre than one of her unacceptable partners. 
Do the same for the men with the ogress. The ogre prefers any woman over the 
ogress, and the ogress prefers any man over the ogre (people are tastier!), but 
the rankings of the humans on the ogre’s and ogress’ lists are immaterial. For 
example, we can augment the preference lists of Figure 2.21 to obtain the 5 x 5 
system of Figure 2.22, using M to denote the ogre and W for the ogress. 


FIGURE 2.22. Augmented preference lists. 


We can now characterize when the original configuration has a complete stable 
matching, that is, a stable pairing where no one is left single. 


Theorem 2.22. Suppose each of n men ranks some subset of n women as accept- 
able partners, and each of the women does the same for the men. Suppose further 
that we obtain an instance of the standard stable marriage problem on n+ 1 men 
and women by adding an ogre M and ogress W, and augmenting the preference 
lists in the manner described above. Then the original system has a complete sta- 
ble matching if and only if the augmented system has a stable matching where M 
is paired with W. 


Proof. Suppose the original system has a complete stable matching. Then each 
woman prefers her partner in this matching to the ogre under the augmented 
preferences, and likewise no man would leave his partner for the ogress. Thus, 
adding (M, W) to this pairing produces a stable matching for the augmented sys- 
tem. Next, suppose the augmented system has a stable matching P’ that includes 
(M, W), and let P = P’ \ {(M, W)}. Suppose (m, w) € P. If m is unaccept- 
able to w, then w would prefer the ogre M over m, and certainly M prefers w 
over W. This contradicts the stability of P’. Similarly, w must be acceptable to 
m. Thus, P is a complete matching of mutually acceptable partners, and stability 
follows at once from the stability of P’. O 


Exercise 2 asks you to show that M and W must be paired together in all 
stable matchings of the augmented system, if they are paired in any particular 
stable matching. Thus, we can determine if a complete stable matching exists by 
running the original Gale-Shapley algorithm on the augmented preference lists, 
choosing either set as the proposers. 
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While applying the Gale-Shapley algorithm in this way always produces a 
matching that is stable with respect to the augmented preferences, it is impor- 
tant to note that restricting such a pairing back to the original preferences might 
not produce a stable matching! For example, when the men propose using the 
augmented lists of Figure 2.22, we obtain the stable matching 


(Iago, Silvia), (Julius, Viola), (Kent, Ursula), (2.130) 
(Laertes, Ogress), (Ogre, Thaisa). . 
However, Kent is not acceptable to Ursula, so we must disband this pair when we 
restrict to the original preference lists. The surviving pairs are (Iago, Silvia) and 
(Julius, Viola), and now Kent and Thaisa are unmatched but mutually acceptable. 


Indifference 


In the original stable marriage problem, we required that all preferences be strictly 
ordered, since each person needed to assign each potential partner a different rank. 
However, rankings often contain items that are valued equally. What happens if 
we allow weakly ordered rankings, that is, rankings that may contain some ele- 
ments of the same rank? Suppose that each of n men supplies a weak ordering of a 
set of 2 women, and each of the women does the same for the men. We’ll assume 
for now that all rankings are complete, so there are no unacceptable partners. Must 
a stable ranking exist? Can we construct one? 

We first require a clarification of our notion of stability for this situation. We 
say amatching MV of the men and women is unstable if there exists an unmatched 
couple m and w, each of whom strictly prefers the other to his or her partner in 
M. For example, if m strictly prefers w to his partner, but w ranks m equal to her 
partner, then the pair m and w do not violate stability under this definition. 

One can certainly study this problem with other notions of stability. For in- 
stance, one could demand that no unmatched man and woman weakly prefer each 
other to their assigned partners. A matching with no such couples is called super- 
stable. Or one could require that no unmatched couple prefer each other, one in a 
strict sense and the other in a weak manner. Such a matching is said to be strongly 
stable. Since the notion that we employ is the least restrictive, matchings with this 
property are often called weakly stable. 

Given a collection of weakly ordered preference lists for n men and n women, 
we can certainly create a corresponding set of strongly ordered preference lists by 
breaking each tie in an arbitrary way. We call the strongly ordered preferences a 
refinement of the original weak preferences. A stable matching for the refined lists 
certainly exists, and it is easy to see that this matching is also a (weakly) stable 
matching for the original, weakly ordered lists. Furthermore, every stable match- 
ing for the original preferences can be obtained in this way. We can summarize 
these facts in the following theorem. 


Theorem 2.23. Suppose each of n men ranks a collection of n women, with tied 
rankings allowed, and each woman does the same for the men. Then a stable 
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matching for these preferences exists, and further every such stable matching is a 
stable matching for some refinement of these weakly ordered rankings. 


Proof. For the first part, let P’ be a refinement of the given list of preferences 
P, and let M be a stable matching for P’. If m and w are unmatched in M, and 
according to P strictly prefer each other to their partners in this matching, then 
they also strictly prefer each other according to P’. This is impossible, since M 
is stable with respect to P’. Thus, M is stable with respect to P. 

For the second part, suppose MV is a stable matching with respect to P. We 
need to construct a refinement P’ of P where I is stable. If (m,w) € M, and 
m ranks w’ equal to w in P, then let m rank w ahead of w’ in P’. Likewise, if 
w ranks m’ equal to m in P, then w ranks m ahead of m’ in P’. Any remaining 
tied rankings are broken arbitrarily to complete P’. Suppose then that mo and wo 
are unmatched in P’, but prefer each other (according to P’) to their partners in 
M. Since M is stable with respect to P, then either mo ranks wo equal to his 
partner in M, or wo ranks mo equal to her partner in / (or both). We obtain a 


contradiction in either case, by the construction of P’. O 
G:D>A=C>oF A: I>G=H=K 
H:A=F>C=D C:H>G=I>K 
I: F>C>D>S>A D:I>K>H>G 
K:D>A=C=FfF F:H=I>G=Kk 


FIGURE 2.23. Preference lists with indifference. 


The Gale-Shapley algorithm requires no modifications for this variation, once 
a refinement is selected. Of course, the algorithm may produce different match- 
ings for different refinements, even when the same group proposes. For example, 
suppose the four men Gatsby, Hawkeye, Ishmael, and Kino, and four women 
Apolonia, Cora, Daisy, and Fayaway, submit the preference lists shown in Fig- 
ure 2.23. Using the refinement obtained by replacing each = in these lists with 
>, the Gale-Shapley algorithm produces the following matching when the men 
propose: 
(Gatsby, Apolonia), (Hawkeye, Fayaway), 


(Ishmael, Cora), (Kino, Daisy). (2.131) 


However, if we reverse the order of Apolonia and Cora in the refinement of 
Gatsby’s list, and the order of Apolonia and Fayaway in Hawkeye’s, we then 
obtain a very different stable matching: 


(Gatsby, Cora), (Hawkeye, Fayaway), 


(Ishmael, Daisy), (Kino, Apolonia). (2.132) 


Finally, we may also ask about combining this extension of the stable marriage 
problem with the prior one. Suppose the men and women supply weakly ordered 
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rankings, and may also declare some potential partners as unacceptable. The sta- 
ble matching problem becomes much more complicated in this case. Even the size 
of a stable matching may vary, in contrast to the case of unacceptable partners with 
strict rankings, where Theorem 2.21 guarantees that all stable matchings have not 
only the same size, but match exactly the same men and women. For example, 
consider the following 2 x 2 system from [196], where A finds Y acceptable but 
not Z, and Z finds B acceptable but not A. 


A:Y Y:A=B 
B:Y>Z Z:B 


These preferences admit exactly two stable matchings, which have different sizes: 
{(A,Y),(B, Z)} and {(B, Y)}. 

We might ask if we could determine a stable matching of maximal size in a 
problem like this, since this would often be desirable. However, no fast algorithm 
is known for computing this in the general n x n case. (Here, a “fast” algorithm 
would have its running time bounded by a polynomial in n.) In fact, it is known 
[196] that this problem belongs to a family of difficult questions known as NP- 
complete problems. The problem remains hard even if ties are allowed in only the 
men’s or only the women’s preferences, and all ties occur at the end of each list, 
even if each person is allowed at most one tied ranking. 


Sets of Different Sizes 


Every stable marriage problem we have considered so far required an equal num- 
ber of men and women. Suppose now that one group is larger than the other. Of 
course, we could not possibly match everyone with a partner now, but can we find 
a stable matching that pairs everyone in the smaller set? Here, we say a matching 
(or partial matching) / is unstable if there exists a man m and woman w, un- 
matched in M, such that each is either single in M, or prefers the other to his or 
her partner in /. 

We can solve this variation by considering it to be a special case of the problem 
with unacceptable partners. Suppose we have k men and n women, with n > k. 
Suppose also that each of the men rank each of the women in strict order, and each 
of the women reciprocate for the men. We introduce n—k ghosts to the set of men. 
Each ghost finds no woman to be an acceptable partner, and each women would 
not accept any ghost. Then a stable matching exists by the modified Gale-Shapley 
algorithm for unacceptable partners, and by Theorem 2.21 there exists a set Xo 
of women and Yo of ghosts and men such that the members of Xo and Yo are 
precisely the unassigned parties in any stable matching. Certainly Yo includes all 
the ghosts, since they have no acceptable partners. But no man can be unassigned 
in a stable matching, since each man is acceptable to all the women. Thus, Xo is 
empty and Yo is precisely the set of ghosts, and we obtain the following theorem. 


Theorem 2.24. Suppose each of k men ranks each of n women in a Strict order- 
ing, and each of the women ranks the men in the same way. Then 


(i) a stable matching exists, 
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(ii) every stable matching pairs every member of the smaller set, and 


(iii) there exists a subset X of the larger set such that every stable matching 
leaves the members of X unassigned, and the others all matched. 


An example with groups of different sizes appears in Exercise 6. Some other in- 
teresting variations (and combinations of variations) on the stable marriage prob- 
lem are introduced in the exercises too. We will study marriage problems further 
in Chapter 3, where in Section 3.8 we investigate matchings for various infinite 


Sets. 


Exercises 


1. Prove that the Gale-Shapley algorithm, amended to handle unacceptable 
partners, always produces a stable matching. 


2. Prove that if the ogre and ogress are paired in some stable matching for an 
augmented system of preferences as in Theorem 2.22, then they must be 
paired in every such stable matching. 


3. (a) Verify the stable matching (2.130) produced by the Gale-Shapley al- 


gorithm when the men propose using the preferences in Figure 2.22. 


(b) Compute the stable matching obtained when the women propose us- 


wm 


ing these preferences. Does this pairing restrict to a stable matching 
for Figure 2.21? 


In the augmentation procedure for the case of unacceptable partners, 
we can list the unacceptable partners for each person in any order after 
the ogre or ogress, and we can list the humans in any order in the lists 
for the ogre and ogress. Show that one can select orderings when aug- 
menting the preferences of Figure 2.21 so that when the men propose 
in the Gale-Shapley algorithm, one obtains a pairing that restricts to a 
stable matching of Figure 2.21. 


4. The following problems all refer to the weakly ordered preference lists of 
Figure 2.23. 


(a) Verify the matching (2.131) obtained from the refinement obtained by 


(b 


(c 


wm 


wm 


replacing each = with >, when the men propose in the Gale-Shapley 
algorithm. Then determine the matching obtained when the women 
propose. 


Verify (2.132) using the refinement obtained from the previous one 
by reversing the order of Apolonia and Cora in Gatsby’s list, and 
Apolonia and Fayaway in Hawkeye’s. Then determine the matching 
obtained when the women propose. 


Construct another refinement by ranking any tied names in reverse 
alphabetical order. Compute the stable matchings constructed by the 
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Gale-Shapley algorithm when the men propose, then when the women 
propose. 


5. Construct three refinements of the following preference lists so that the 
Gale-Shapley algorithm, amended for unacceptable partners, produces a 
stable matching of a different size in each case. 


A:W W:A=B=C=D 
B:W>xX Xi? B=C=D 
C:wWw>x>yYy ¥: GD 
D:W>X>Y>Z 4 :D 


6. Suppose the five men Arceneaux, Boudreaux, Comeaux, Duriaux, and Gaut- 
reaux, each rank the three women Marteaux, Robichaux, and Thibodeaux 
in order of preference, and the women each rank the men, as shown in the 
following tables. 


Determine the stable matching obtained when the men propose, then the 
matching found when the women propose. What is the set X of Theo- 
rem 2.24 for these preferences? 


7. Suppose we allow weakly ordered rankings in the hypothesis of Theo- 
rem 2.24. Determine which of the conclusions still hold, and which do not 
necessarily follow. Supply a proof for any parts that do hold, and supply a 
counterexample for any parts that do not. 


8. Suppose that each of n students, denoted S$, So, ..., S,, ranks each of 
m universities, U;, U2, ..., Um, and each university does the same for the 
students. Suppose also that university U;, has p;, open positions. We say an 
assignment of students to universities is unstable if there exists an unpaired 
student 5; and university U; such that S; is either unassigned, or prefers U; 
to his assignment, and U; either has an unfilled position, or prefers S; to 
some student in the new class. 


(a) Assume that )>;"_, py = n. Explain how to amend the preference lists 
so that the Gale-Shapley algorithm may be used to compute a stable 
assignment of students to universities, with no university exceeding 
its capacity. 
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(b) Repeat this problem without assuming that the number of students 
matches the total number of open positions. 


(c) Suppose each student ranks only a subset of the universities, and each 
university ranks only a subset of the students who apply to that school. 
Assume that unranked possibilities are unacceptable choices. Modify 
the definition of stability for this case, then describe how to use the 
Gale-Shapley algorithm to determine a stable assignment. 


9. Suppose that each of n students, denoted S, S2,..., S,,, needs to enroll in 
a number of courses from among m possible offerings, denoted C), C2,..., 
Cin. Assume that student S; can register for up to g; courses, and course 
C; can admit up to r; students. An enrollment is a set of pairs (S;, Cj) 
where each student 5S; appears in at most qg; such pairs, and each course C;; 
appears in at most 7; pairs. Suppose each student ranks a subset of accept- 
able courses in order of preference, and the supervising professor of each 
course ranks a subset of acceptable students. Define a stable enrollment in 
an appropriate way. 


2.10 Combinatorial Geometry 


We should expose the student to some material that has strong 
intuitive appeal, is currently of research interest to professional 
mathematicians, and in which the student himself may discover 
interesting problems that even the experts are unable to solve. 
— Victor Klee, from the translator’s preface to 
Combinatorial Geometry in the Plane [144] 


The subject of combinatorial geometry studies combinatorial problems regarding 
arrangements of points in space, and the geometric figures obtained from them. 
Such figures include lines and polygons in two dimensions, planes and polyhedra 
in three, and hyperplanes and polytopes in n-dimensional space. This subject has 
much in common with the somewhat broader subject of discrete geometry, which 
treats all sorts of geometric problems on discrete sets of points in Euclidean space, 
especially extremal problems concerning quantities such as distance, direction, 
area, volume, perimeter, intersection counts, and packing density. 

In this section, we provide an introduction to the field of combinatorial geom- 
etry by describing two famous problems regarding points in the plane: a question 
of Sylvester concerning the collection of lines determined by a set of points, and 
a problem of Erdés, Klein, and Szekeres on the existence of certain polygons that 
can be formed from large collections of points in the plane. The latter problem 
leads us again to Ramsey’s theorem, and we prove this statement in a more gen- 
eral form than what we described in Section 1.8. (Ramsey theory is developed 
further in Chapter 3.) In particular, we establish some of the bounds on the Ram- 
sey numbers R(p, q) that were cited in Section 1.8. 
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2.10.1 Sylvester’s Problem 


Thufferin’ thuccotash! 
— Sylvester the cat, Looney Tunes 


James Joseph Sylvester, a British-born mathematician, spent the latter part of his 
career at Johns Hopkins University, where he founded the first research school in 
mathematics in America, and established the first American research journal in 
the subject, The American Journal of Mathematics. Toward the end of his career, 
Sylvester posed the following problem in 1893, in the “Mathematical Questions” 
column of the British journal, Educational Times [265]. 


Sylvester’s Problem. Given n > 3 points in the plane which do not all lie on the 
same line, must there exist a line that passes through exactly two of them? 


Given a collection of points in the plane, we say a line is ordinary if it passes 
through exactly two of the points. Thus, Sylvester’s problem asks if an ordinary 
line always exists, as long as the points are not all on the same line. 

This problem remained unsolved for many years, and seemed to have been 
largely forgotten until Erdés rediscovered it in 1933. Tibor Gallai, a friend of 
Erdés’ who is also known as T. Griinwald, found the first proof in the same year. 
Erdos helped to revive the problem by posing it in the “Problems” section of the 
American Mathematical Monthly in 1933 [89], and Gallai’s solution was pub- 
lished in the solution the following year [264]. 

Kelly also produced a clever solution, which was published in a short article 
by Coxeter in 1948 [62], along with a version of Gallai’s argument. Forty years 
later, the computer scientist Edsger Dijkstra derived a similar proof, but with a 
more algorithmic viewpoint [76]. The proof we present here is based on Dijkstra’s 
algorithm. Given any collection of three or more points which do not all lie on the 
same line, it constructs a line with the required property. 

In this method, we start with an arbitrary line 2; connecting at least two points 
of the set, and some point S; from the set that does not lie on /;. If ; contains just 
two of the points, we are done, so suppose that at least three of the points lie on 
¢,. The main idea of the method is to construct from the current line @; and point 
S; another line @2 and point S2, with Sj not on @2. Then we iterate this process, 
constructing @3 and S3, then @4 and $4, etc., until one is assured of obtaining a line 
that connects exactly two of the points of the original collection. In order to ensure 
that the procedure does not cycle endlessly, we introduce a termination argument: 
a strictly monotone function of the state of the algorithm. A natural candidate is 
the distance d;, from the point S;, to the line @;,, so dz, = d(Sx, €;,). We therefore 
aim to construct ,4 , and S,41 from @, and S$; in such a way that dpi. < dk. 
Since there are only finitely many points, there are only finitely many possible 
values for d;,, so if we can achieve this monotonicity, then it would follow that the 
procedure must terminate. 

We derive a procedure that produces a strictly decreasing sequence {d;,}. Sup- 
pose the line @;, contains the points P,, Q,, and R; from our original collection, 
and Sj, is a point from the set that does not lie on ¢;. We need to choose ¢,4 1 and 
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Sk41 so that dpi1 < dz. Suppose we set 5,41 to be one of the points that we 
labeled on ¢;, say Sp41 = Qz. Certainly Q;, does not lie on either of the lines 
P,,5S;, or RySy, So we might choose one of these two lines for our ¢,41. Can we 
guarantee that one of these choices will produce a good value for d;,1? To test 
this, let 


Pr = d(Qk, PeSk) 
and —— 
Te = A(Qz, ReSk). 
We require then that 
min(pr, Tr) < dk. (2.133) 


FIGURE 2.24. Similar triangles in the construction. 


Using similar triangles in Figure 2.24, we see that the inequality p, < dy is 
equivalent to the statement 


U( Pe, Qk) < d(Pe, Sk), (2.134) 
and likewise r;, < dj, is equivalent to the inequality 
d(Qr, Rr) < d(Sx, Re). (2.135) 
Now at least one of (2.134) or (2.135) must hold if 
U(Pr,Qxn) + d(Qr, Rr) < d( Pr, Sr) + d(Sz, Re). 
Further, since 5;, does not lie on @, by the triangle inequality we know that 
d( Px, Re) < d(Pr, Sk) + d(S%, Re). 
Therefore, inequality (2.133) follows from the statement 
d(Pr,Qr) + d(Qr, Re) < d( Pr, Re). 
However, by the triangle inequality, we know that 


(Pr, Qk) + d(Qr, Re) > d( Pe, Re). 
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Thus, we require that 
d( Py, Qe) +d(Qz, Ry) = d( Py, Re). 


Clearly, this latter condition holds if and only if Q; lies between P;, and R;, on 
¢;,. We therefore obtain the following algorithm for solving Sylvester’s problem. 


Algorithm 2.25. Construct an ordinary line. 
Input. A set of n > 3 points in the plane, not all on the same line. 
Output. A line connecting exactly two of the points. 


Description. 


Step I. Let ¢; be a line connecting at least two of the points in the given set, 
and let S; be a point from the collection that does not lie on ¢;. Set 
k; = 1, then perform Step 2. 


Step 2. If £;, contains exactly two points from the original collection, then out- 
put @;, and stop. Otherwise, perform Step 3. 


Step 3. Let P,, Q,, and R, be three points from the given set that lie on @,, 
with Q; lying between P;, and Rx. Set S41 = Qz, and set p41 = 
PrSy if d(Qx, PeSk) < d(Qx, PrRz); otherwise set 0.4, = RpSp. 
Then increment /: by 1 and repeat Step 2. 


Now Sylvester’s problem is readily solved: The monotonicity of the sequence 
{d;,} guarantees that the algorithm must terminate, so it must produce a line con- 
necting just two points of the given set. An ordinary line must therefore always 
exist. 

We can illustrate Dijkstra’s algorithm with an example. Figure 2.25 shows a 
collection of thirteen points that produce just six ordinary lines (shown in bold), 
along with 21 lines that connect at least three of the points. Figure 2.26 illustrates 
the action of Algorithm 2.25 on these points, using a particular initial configura- 
tion. Each successive diagram shows the line @;, the point S;, off the line, and the 
points P;,, Q,, and Rx on the line. 

Much more is now known about Sylvester’s problem. For example, Csima and 
Sawyer [64,65] proved that every arrangement of n > 3 points in the plane, 
not all on the same line, must produce at least 6n/13 ordinary lines, except for 
certain arrangements of n = 7 points. Figure 2.25 shows that this bound is best 
possible, and Exercise 2 asks you to determine an exceptional configuration for 
n = 7. Also, it has long been conjectured that there are always at least [n/2] 
ordinary lines for a set of n non-colinear points, except forn = 7 andn = 13, but 
this remains unresolved. For additional information on Sylvester’s problem and 
several of its generalizations, see the survey article by Borwein and Moser [34], 
or the book by Brass, Moser, and Pach [37, sec. 7.2]. 
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FIGURE 2.25. A collection of thirteen points with just six ordinary lines. 


Exercises 


il 


Exhibit an arrangement of six points in the plane that produce exactly three 
ordinary lines. 


. Exhibit an arrangement of seven points in the plane that produce exactly 


three ordinary lines. 


. Exhibit an arrangement of eight points in the plane that produce exactly 


four ordinary lines. 


. Exhibit an arrangement of nine points in the plane that produce exactly six 


ordinary lines. 


. Suppose n > 3 points in the plane do not all lie on the same line. Show that 


if one joins each pair of points with a straight line, then one must obtain at 
least n distinct lines. 


. We say a set of points B is separated if there exists a positive number 6 


such that the distance d(P, Q) > 6 for every pair of points P and Q in B. 
Describe an infinite, separated set of points in the plane, not all on the same 
line, for which no ordinary line exists. What happens if you apply Dijkstra’s 
algorithm to this set of points? 


. Repeat problem 6, if each of the points (,y) must in addition satisfy 


ly| <1. 
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FIGURE 2.26. Dijkstra’s algorithm. 
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8. Let the set S consist of the point (0,0), together with all the points in the 
plane of the form (374+, =4;), (se4z, a4), or (0, y45), where k is 
an arbitrary integer. Show that every line connecting two points of S must 


intersect a third point of S. 


9. Consider the following collection T' of three-element subsets of the seven- 
element set S = {a,b,c,d,e, f,g}: 


T = {{a, b, Gh, {a, d, é}, {a, i}; {b, d, Ths {b, e, 9g}, {e, d, gh, 168, Th}. 


(a) Verify that each two-element subset of S is in fact a subset of one of 
the members of 7’, and that any two distinct sets in T’ have at most 
one element in common. 


(b) Explain how this example is germane to Sylvester’s problem. Hint: 
Try thinking of the elements of S as points, and the elements of T as 
lines. 


2.10.2 Convex Polygons 


I would certainly pay $500 for a proof of Szekeres’ conjecture. 
— Paul Erdés, [92, p. 66] 


A set of points S in the plane is said to be convex if for each pair of points a and 
bin S, the line segment joining a to 6 lies entirely in S. Loosely, then, a convex 
set has no “holes” in its interior, and no “dents” in its boundary. Line segments, 
triangles, rectangles, and ellipses are thus all examples of convex sets. 

The convex hull of a finite collection of points T’ in the plane is defined as 
the intersection of all closed convex sets which contain T’. Less formally, if one 
imagines T' represented by a set of pushpins in a bulletin board, then the convex 
hull of T’ is the shape enclosed by a rubber band when it is snapped around all 
the pushpins. The convex hull of a set of three points then is either a triangle or a 
line segment, and for four points we may obtain one of these shapes, or a convex 
quadrilateral. 

In order to avoid degenerate cases, we will assume in this section that our given 
collection of points is in general position, which means that no three points lie on 
the same line, or, using the term from the previous section, that each line connect- 
ing two of the points is ordinary. Thus, the convex hull of a set of four points in 
general position forms either a quadrilateral, or a triangle whose interior contains 
the fourth point of the collection. In the early 1930s, Esther Klein observed that 
one can always find a convex quadrilateral in a collection of five points in general 
position. 


Theorem 2.26. Any collection of five points in the plane in general position con- 
tains a four-element subset whose convex hull is a quadrilateral. 


Proof. Suppose we are given a collection of five points in the plane, with no three 
on the same line. If their convex hull is a pentagon or a quadrilateral, then the 
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statement follows, so suppose that it forms a triangle. Let a and b be the two 
points of the collection lying inside the triangle, and let ¢ be the line connecting a 
and b. Since the points are in general position, two of the vertices of the triangle 
lie on one side of ¢. Label them c and d. Then the convex hull of {a, b,c, d} is a 
quadrilateral. See Figure 2.27. O 


FIGURE 2.27. A convex quadrilateral may always be found among five points in general 
position. 


Klein then asked about a natural generalization. How many points in the plane 
(in general position) are required in order to be certain that some subset forms 
the convex hull of a polygon with n sides? Does such a number exist for each n? 
For example, Figure 2.28 illustrates a collection of eight points, no five of which 
produce a convex pentagon, and a set of sixteen points, no six of which forms 
a convex hexagon. Thus, at least nine points are needed for n = 5, and at least 
seventeen for n = 6. 


FIGURE 2.28. Eight points with no convex pentagon, and sixteen points with no convex 
hexagon. 


Erd6és and Szekeres studied this problem in their first joint paper, in 1935 [94]. 
There they independently developed a version of Ramsey’s theorem, and the proof 
we describe in this section is based on their argument. The statement we develop 
here is much more general than the special case of Ramsey’s theorem that we 
described in Section 1.8, although Ramsey in fact established a still more general 
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result in his seminal paper of 1930 [232] (see Exercise 7). We will also derive 
the bounds on the ordinary Ramsey numbers R(m,7n) stated in Theorems 1.63 
and 1.64 of Section 1.8 as special cases. 

Let ES(n) denote the minimal number of points in the plane in general position 
that are required so that there must exist a subcollection of n points whose convex 
hull is a polygon with n sides (an n-gon). Thus, we have seen that ES(3) = 3, 
ES(4) = 5, and, from Figure 2.28, that ES(5) > 9 and ES(6) > 17. We aim 
to show that ES(n) exists for each n by obtaining an upper bound on its value, 
in terms of n. As a first step, we show that it is enough to find a collection of n 
points, each of whose four-element subsets forms a convex quadrilateral. 


Theorem 2.27. Suppose S is a set of n points in the plane in general position 
with the property that each four-element subset of S is the vertex set of a convex 
quadrilateral. Then S is the set of vertices of a convex n-gon. 


Proof. Let H denote the convex hull of S, and suppose a € S lies in the interior 
of H. Let b € S with a ¥ b. Divide H into triangles by joining b to each vertex 
of H. Then a lies in the interior of one of these triangles, and we label its vertices 
b, c, and d. But then {a, b, c, d} is a four-element subset of S whose convex hull 
is a triangle, contradicting our assumption. Oo 


Next, we develop the more general version of Ramsey’s theorem. Recall that in 
Section 1.8 we defined R(m, n) to be the smallest positive integer NV such that any 
2-coloring of the edges of the complete graph K y (using the colors red and blue) 
must produce either a red K,,, or a blue K,, as a subgraph. Coloring each edge of 
Ky is certainly equivalent to assigning a color to each of the (¥) subsets of size 2 
of the set {1,2,...,.N}, and so we might consider what happens more generally 
when we assign a color to each of the ie ) subsets of size k, for a fixed positive 
integer k. We call such a subset a k-subset of the original set. Ramsey’s theorem 
extends in a natural way to this setting. For convenience, we let |] denote the set 
{1,2,...,.N}, and we define the generalized Ramsey numbers in the following 
way. 


Definition. For positive integers k, m, and n, with m > k andn > k, the Ramsey 
number R;(m, 7) is defined as the smallest positive integer N such that in any 
2-coloring of the k-subsets of [V] (using the colors red and blue) there must exist 
either a subset of m elements, each of whose k-subsets is red, or a subset of n 
elements, each of whose k-subsets is blue. 


Thus, the Ramsey numbers R(m,n) of Section 1.8 are denoted by R2(m,n) 
here. Also, just as the ordinary Ramsey numbers can be described in terms of 
coloring edges of complete graphs, so too can we describe R;,(m,n) in terms of 
coloring edges of certain hypergraphs (see Exercise 1). 

The next theorem establishes that the Ramsey numbers R;,(m, 7) always exist, 
and provides an upper bound on their values. 
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Theorem 2.28 (Ramsey’s Theorem). Let k, m, and n be positive integers, with 
min{m,n} > k. Then the Ramsey number Ry(m,n) exists. Furthermore, for 
each such k, m, and n, we have 


Ri(m,n) =m+n-1, (2.136) 
Ry(k,n) =n, (2.137) 
Ry(m,k) =m, (2.138) 


and, if min{m,n} > k > 2, then 
Ri(m,n) < Ry-1(Re(m — 1,n) + Re(m,n— 1)) +1. (2.139) 


Proof. First, consider the case k = 1. If the elements of [NV] are each colored red 
or blue, and there are fewer than m red elements and fewer than n blue elements, 
then certainly VN < m+n — 2, and (2.136) follows. 

Second, suppose & = m, and suppose that each k-subset of [V] is colored red 
or blue. If any is red then we have a qualifying m-subset, so suppose all are blue. 
Then we have a qualifying n-subset precisely when N > n. Thus, the formula 
(2.137) follows, and by symmetry so does (2.138). 

To establish (2.139), suppose min{m,n} > k > 2. Using induction on k, 
we may assume that R;,—1(a, b) exists for all integers a and b with min{a, b} > 
k; — 1, and further by induction on m + n we may assume that R;,(m — 1,n) 
and Ry(m,n — 1) both exist. Let m’ = Ry(m— 1,n), n’ = Ri, (m,n — 1), and 
N = Rg_-1(m’,n’) + 1, and consider an arbitrary 2-coloring C of the k-subsets 
of [N] using the colors red and blue. Create a coloring C’ of the (& — 1)-subsets 
of [N — 1] by assigning a subset X of size k — 1 the color of the set X U {N} 
in C. Since N — 1 = Ry_1(m’,n’), the coloring C’ must produce either a subset 
of [N — 1] of cardinality m’, each of whose (k — 1)-subsets is red, or a subset of 
[N — 1] of cardinality n’, each of whose (& — 1)-subsets is blue. Suppose the first 
possibility occurs (the argument for the second case is symmetric), and let S be 
a qualifying subset of [N — 1]. Since S has m’ = R;,(m — 1,n) elements, there 
must exist either a subset of size m— 1 of S, each of whose k-subsets is red in the 
original coloring C, or a subset of size n of S, each of whose k-subsets is blue 
in C’. In the latter case, we are done, so suppose the former case occurs, and let 
T be such a subset of [N — 1]. Let T’ = TU {N}, and suppose X is a k-subset 
of T’. If N ¢ X, then X C S,so X isredin C. If N € X, then X \{N} isa 
(k — 1)-subset of S and so is red in C’, and thus X is red in C. O 


Using this result, we can now establish the upper bound for the original Ramsey 
numbers R2(m, 7) that was cited in Section 1.8. 


Corollary 2.29. Suppose m and n are integers with min{m,n} > 2. Then 


R2(m,n) < Ro(m— 1,n) + Ro(m,n-— 1) (2.140) 
and : 
Ro(m,n) < aw ) (2.141) 
m—-1 


274 2. Combinatorics 


Proof. The inequality (2.140) follows at once from (2.136) and (2.139). The for- 
mulas (2.137) and (2.138) produce equality in (2.141) for the cases m = 2 and 
n = 2 respectively, and the general inequality follows by induction on m+ n (see 
Exercise 3). O 


Armed with Ramsey’s theorem, we may now prove that a sufficiently large 
collection of points in the plane in general position must contain a subset that 
forms the vertices of a convex n-gon, for any positive integer n. 


Theorem 2.30. [fn > 3 is an integer, then ES(n) < R4(5, 7). 


Proof. Let S be a collection of N = R4(5,n) points in the plane in general 
position. For each four-element subset T' of S, assign T the color red if its convex 
hull is a triangle, and assign it the color blue if it is a quadrilateral. By Ramsey’s 
Theorem, there must exist either a five-element subset of S whose 4-subsets are 
all red, or an n-element subset of S whose 4-subsets are all blue. The former case 
is impossible by Theorem 2.26, so the latter case must occur, and this implies that 
the n points form the vertex set of a convex n-gon by Theorem 2.27. O 


Much more is known about the quantity ES(m). In the same article [94], Erdés 
and Szekeres employ a separate geometric argument to show that in fact 


ES(n) < nS) 44, 


Since then, this bound has been improved several times. For example, in 2005 
Toth and Valtr [268] proved that 


ES(n) < (>?) fia 


forn > 5. 

Few exact values of ES(n) have been determined. In [94], Erd6és and Szekeres 
noted that Makai first proved that ES(5) = 9, so Figure 2.28 exhibits an extremal 
configuration. Proofs of this statement were published later in [171] and [30]. In 
2006, Szekeres and Peters [266] employed a computational strategy to establish 
that ES(6) = 17. Thus, again Figure 2.28 illustrates an optimal arrangement. 
Erdés and Szekeres conjectured that in fact ES(n) = 2”~? + 1 for all n > 3, and 
this problem remains open. This is the $500 conjecture that Erdés was referring 
to in the quote that opens this section. 

It is known that ES(n) cannot be any smaller than the conjectured value. In 
1961, Erdés and Szekeres [95] described a method for placing 2”~? points in the 
plane in general position so that no convex n-gon appears. Their construction was 
later corrected by Kalbfleisch and Stanton [172]. Thus, certainly 


ES(n) > 27-7 41 


for n > 7. For additional information on this problem and many of its general- 
izations, see for instance the books by Brass, Moser, and Pach [37, sec. 8.2] and 
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MatouSek [200, chap. 3], the survey article by Morris and Soltan [208], or the 
note by Dumitrescu [82]. 


Exercises 


1. 


2 


State Ramsey’s theorem in terms of coloring edges of certain hypergraphs. 


Exhibit a collection of eight points in general position in the plane whose 
convex hull is a triangle, so that no subset of four points forms the vertex 
set of a convex quadrilateral. 


. Complete the proof of Corollary 2.29. 


. (Johnson [169].) If S is a finite set of points in the plane in general position, 


and T is a subset of S of size 3, let ys(T) denote the number of points of 
S that lie in the interior of the triangle determined by 7’. Complete the 
following argument to establish a different upper bound on ES(n). 


(a) Let n > 3 be an integer. Prove that if S is sufficiently large, then there 
exists a subset U of S of size n such that either every 3-subset T of UV 
has ws(T’) even, or every such subset has ¢5(T) odd. 


(b) If U does not form the vertex set of a convex n-gon, then by Theo- 
rem 2.27 there exist four points a, b, c, and d of U, with d lying inside 
the triangle determined by a, b, and c. Show that 


vs({a, b, ch) = vs({a, b, d}) ot ws({b, C, d}) “1 ws({a, C, d}) als 


(c) Establish a contradiction and conclude that ES(n) < R3(n, 7). 


. (Tarsy [188].) If a, b, and c form the vertices of a triangle in the plane, let 


O(a, b,c) = 1 if the path a > b > c — a induces a clockwise orientation 
of the boundary, and let (a,b,c) = —1 if it is counterclockwise. Thus, 
for example, (a, b,c) = —0(a,c, b). Complete the following argument to 
establish an upper bound on ES(n). 


(a) Let n > 3 be an integer, and let S = {v1,v2,...,vn} be a set of 
labeled points in the plane in general position. Prove that if V is suffi- 
ciently large, then there exists a subset U of S of size n such that either 
every 3-subset {v;,v;, ux} of U witha < j < khas 6(u;,v;, ve) = 1, 
or every such subset has 6(v;, vj, vz) = —1. 


(b 


wm 


Prove that if S' contains a 4-subset whose convex hull is a triangle, 
then this subset must contain triangles of both orientations with re- 
spect to the ordering of the vertices. 


(c) Conclude that ES(n) < R3(n,n). 
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6. Complete the proof of Theorem 1.64 by proving that if m and n are positive 


integers with min{m,n} > 2, and Ro(m—1,n) and Ro(m, n—1) are both 
even, then 


R2(m,n) < Ro(m—1,n) + Ro(m,n — 1) - 1. 


Use the following strategy. Let r; = Ro(m —1,n), ro = Ro(m,n— 1), 
and N = r; + rg — 1. Suppose that the edges of Ky are 2-colored, using 
the colors red and blue, in such a way that no red 1, nor blue K,, appears. 


(a) Show that the red degree of any vertex in the graph must be less than 
T 1. 
(b) Show that the red degree of any vertex in the graph must equal 7; — 1. 


(c) Compute the number of red edges in the graph, and establish a con- 
tradiction. 


. Prove the following more general version of Ramsey’s theorem. Let k, 11, 


ng, ..., Ny be positive integers, with min{n1,...,n,-} > k, and let cy, 
C2, ..., Cr denote r different colors. Then there exists a positive integer 
R,(m1,...,M,) such that in any r-coloring of the k-subsets of a set with 
N > Rz(n1,...,,) elements, there must exist a subset of n; elements, 
each of whose k-subsets has color c;, for some 7 with 1 <i <r. 


. (Schur [251].) If C is an r-coloring of the elements of [V], then let C’ be the 


r-coloring of 2-subsets of [N] U {0} obtained by assigning the pair {a, b} 
the color of |b — a] in C. 


(a) Use the generalized Ramsey’s Theorem of Exercise 7 to assert that if 
N is sufficiently large then in [IV] U {0} there must exist a set of three 
nonnegative integers, each of whose 2-subsets has the same color in 
Cc’. 

(b) Conclude that if V is sufficiently large then there exist integers a and 
bin [N], with a + b < N, such that a, b, and a + b all have the same 
color in C. 


9. Let S be a finite set of points in the plane, and let P be a convex polygon 


whose vertices are all selected from 5S. We say P is empty (with respect 
to S') if its interior contains no points of S. Erdés asked if for each integer 
n > 3 there exists a positive integer ESo(n) such that any set of at least 
ESo(n) points in general position in the plane must contain an empty n- 
gon, but this need not be the case for sets with fewer than ESo(n) points. 


(a) Compute ESo(3) and ESo(4). 

(b) (Ehrenfeucht [91].) Prove that ESo(5) exists by completing the fol- 
lowing argument. Let S be a set of ES(6) points in general position 
in the plane, and let P be a convex hexagon whose vertices lie in S, 
selected so that its interior contains a minimal number of points of S. 
Denote this number by m. 
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i. Complete the proof ifm = 0orm = 1. 

ii. If m > 2, let H be the convex hull of the points of S lying inside 
P, and let @ be a line determined by two points on the boundary 
of H. Finish the proof for this case. 


The argument above establishes that ESo9(5) < 17; in 1978 Harborth [151] 
showed that in fact ES9(5) = 10. Horton [164] in 1983 proved the sur- 
prising result that ESo(n) does not exist for n > 7. More recently, Gerken 
[121] and Nicolas [215] solved the problem for n = 6: A sufficiently large 
set of points in the plane in general position must contain an empty convex 
hexagon. The precise value of ESo(6) remains unknown, though it must 
satisfy 30 < ESo(6) < ES(9) < 1717. (An example by Overmars [219] 
establishes the lower bound; additional information on the upper bound can 
be found in [182,271].) 


2.11 References 


You may talk too much on the best of subjects. 
— Benjamin Franklin, Poor Richard’s Almanack 


We list several additional references for the reader who wishes to embark on fur- 
ther study. 


General References 


The text by van Lint and Wilson [273] is a broad and thorough introduction to the 
field of combinatorics, covering many additional topics. Classical introductions to 
combinatorial analysis include Riordan [235] and Ryser [246], and many topics in 
discrete mathematics and enumerative combinatorics are developed extensively in 
Graham, Knuth, and Patashnik [133]. The text by Pélya, Tarjan, and Woods [227] 
is a set of notes from a course in enumerative and constructive combinatorics. A 
problems-oriented introduction to many topics in combinatorics and graph theory 
can be found in Lovasz [191]. The book by Nijenhuis and Wilf [216] describes 
efficient algorithms for solving a number of problems in combinatorics and graph 
theory, and a constructive view of the subject is developed in Stanton and White 
[263]. Texts by Aigner [4,5], Berge [24], Comtet [60], Hall [146], and Stanley 
[261,262] present more advanced treatments of many aspects of combinatorics. 


Combinatorial Identities 


The history of binomial coefficients and Pascal’s triangle is studied in Edwards 
[85], and some interesting patterns in the rows of Pascal’s triangle are observed 
by Granville [138]. Combinatorial identities are studied in Riordan [236], and 
automated techniques for deriving and proving identities involving binomial co- 
efficients and other quantities are developed in PetkovSek, Wilf, and Zeilberger 
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[222]. Combinatorial proofs for many identities are also developed in the book by 
Benjamin and Quinn [22]. 


Pigeonhole Principle 


More nice applications of the pigeonhole principle, together with many other suc- 
cinct proofs in combinatorics and other subjects, are described in Aigner and 
Ziegler [6]. An interesting card trick based in part on a special case of Theo- 
rem 2.4 is described by Mulcahy [210]. Polynomials with {—1, 0, 1} coefficients 
and a root of prescribed order m at x = 1, as in Exercise 14 of Section 2.4, are 
studied by Borwein and Mossinghoff [35]. 


Generating Functions 


More details on generating functions and their applications can be found for in- 
stance in the texts by Wilf [284] and Graham, Knuth, and Patashnik [133], and 
in the survey article by Stanley [260]. The problem of determining the minimal 
degree d;, of a polynomial with {0, 1} coefficients that is divisible by (2 + 1)*, as 
in Exercise 5 of Section 2.6.5, is studied by Borwein and Mossinghoff [36]. Some 
properties of the generalized Fibonacci numbers (Exercise 8b of Section 2.6.5) 
are investigated by Miles [203]. 


Polya’s Theory of Counting 


Pélya’s seminal paper on enumeration in the presence of symmetry is translated 
into English by Read in [226]. Redfield [233] independently devised the notion of 
a cycle index for a group, which he termed the group reduction formula, ten years 
before Pélya’s paper. As a result, many texts call this topic Pélya-Redfield theory. 
This theory, along with the generalization incorporating a color group, is also de- 
scribed in the expository article by de Bruijn [68], and his research article [69]. 
Further generalizations of this theory are explored by de Bruijn in [70], culminat- 
ing in a “monster theorem.” Another view of de Bruijn’s theorem is developed by 
Harary and Palmer in [149; 150, chap. 6]. 

Applications of this theory in chemistry are described in the text by Fujita 
[116], and additional references for enumeration problems in this field are col- 
lected in the survey article [13]. Some applications of Pélya’s and de Bruijn’s 
theorems in computer graphics appear for example in articles by Banks, Linton, 
and Stockmeyer [15, 16]. 


More Numbers 


The book [10] by Andrews and Eriksson is an introduction to the theory of parti- 
tions of integers, directed toward undergraduates. A more advanced treatment is 
developed by Andrews [9]. Euler’s original proof of the pentagonal number the- 
orem, along with some of its additional ramifications, is described by Andrews 
in [8]. 
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The history of Stirling numbers, the notations developed for them, and many in- 
teresting identities they satisfy are discussed by Knuth in [177]. Rhyming schemes, 
as in Exercise 7d of Section 2.8.3 and Exercise 8 of Section 2.8.4, are analyzed 
by Riordan [237]. Stirling set numbers arise in a natural way in an interesting 
problem on juggling in an article by Warrington [280]. Some identities involving 
the complementary Bell numbers (Exercise 6 of Section 2.8.4) are established in 
the article by Uppuluri and Carpenter [270]. 

Eulerian numbers appear in the computation of the volume of certain slabs of n- 
dimensional cubes in articles by Chakerian and Logothetti [51] and Marichal and 
Mossinghoff [197], and in the solution to a problem concerning a novel graduation 
ceremony in an article by Gessel [122]. 

The reference book by Sloane and Plouffe [258] and website by Sloane [257] 
catalog thousands of integer sequences, many of which arise in combinatorics and 
graph theory, and list references to the literature for almost all of these sequences. 
The book by Conway and Guy [61] is an informal discussion of several kinds of 
numbers, including many common combinatorial sequences. 


Stable Marriage 


The important results of Gale and Shapley appeared in [117]. A fast algorithm that 
solves the “stable roommates” problem whenever a solution exists was first de- 
scribed by Irving in [166]. Stable matching problems are studied in Knuth [178] 
as motivation for the mathematical analysis of algorithms, and the structure of 
stable matchings in marriage and roommate problems is described in detail by 
Gusfield and Irving [143], along with algorithms for their computation. A match- 
ing algorithm for the “many-to-many” variation of the stable marriage problem, 
as in Exercise 9 of Section 2.9.2, is developed by Baiou and Balinski [14]. The 
monograph by Feder [103] studies extensions of the stable matching problem to 
more general settings. 


Combinatorial Geometry 


A survey on Sylvester’s problem regarding ordinary lines for collections of points, 
as well as related problems, appears in Borwein and Moser [34]. A variation of 
Sylvester’s theorem for an infinite sequence of points lying within a bounded re- 
gion in the plane is investigated by Borwein [33]. The influential paper of Erdés 
and Szekeres on convex polygons, first published in [94], also appears in the col- 
lection by Gessel and Rota [123]. The survey by Morris and Soltan [208] sum- 
marizes work on this problem and several of its variations. Dozens of problems 
in combinatorial geometry, both solved and unsolved, are described in the books 
by Brass, Moser, and Pach [37], Hadwiger, Debrunner, and Klee [144], Herman, 
Kuéera, and Sima [158], and MatouSek [200], as well as the survey article by 
Erdés and Purdy [93]. 
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Collected Papers 


The collection [123] by Gessel and Rota contains many influential papers in com- 
binatorics and graph theory, including the important articles by Erdés and Szek- 
eres [94], Polya [225], and Ramsey [232]. The two-volume set edited by Graham 
and NeSetril [134, 135] is a collection of articles on the mathematics of Paul Erdés, 
including many contributions regarding his work in combinatorics and graph the- 
ory. The Handbook of Combinatorics [131, 132] provides an overview of dozens 
of different areas of combinatorics and graph theory for mathematicians and com- 
puter scientists. 


3 


Infinite Combinatorics and Graphs 


... the definitive clarification of the nature of the infinite has become 


necessary... 
— David Hilbert [160] 


Infinite sets are very peculiar, and remarkably different from finite sets. This can 
be illustrated with a combinatorial example. 

Suppose we have four pigeons and two pigeonholes. If we place the pigeons 
in the pigeonholes, one of the pigeonholes must contain at least two pigeons. 
This crowding will always occur, regardless of the arrangement we choose for the 
pigeons. Furthermore, the crowding will occur whenever there are more pigeons 
than holes. In general, if P (pigeons) is a finite set, and H (pigeonholes) is a 
proper subset of P, then there is no matching between the elements of P and H. 

Now suppose that we have a pigeon for each real number in the closed interval 
P = (0, 2]. Put a leg tag on each pigeon with its real number. Also suppose that we 
have a pigeonhole for each real number in the interval H = [0, 1]. Put an address 
plate on each pigeonhole with its real number. Note that H C P, so the set of 
address plate numbers is a proper subset of the set of leg tag numbers. For each 
x € [0,2], place the pigeon tagged x in the pigeonhole with address x/2. Using 
this arrangement, no two pigeons will be assigned to the same pigeonhole. Thus, 
if P is infinite and H is a proper subset of P, there may be a matching between 
the elements of P and those of H. 

Infinite sets behave differently from finite sets, and we have used ideas from 
graph theory and combinatorics to illustrate this difference. One of the justifica- 
tions for studying infinite versions of combinatorial and graph-theoretic theorems 
is to gain more insight into the behavior of infinite sets and, by contrast, more 
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FIGURE 3.1. A matching. 
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insight into finite sets. Sections 3.1 and 3.2 follow this agenda, culminating in 
a proof of a finite combinatorial statement using infinite tools. We can also use 
combinatorial properties to distinguish between different sizes of infinite sets, as 
is done in Section 3.7. This requires the deeper understanding of the axioms for 
manipulating infinite sets provided by Sections 3.3 and 3.4, and a precise notion 
of size that appears in Section 3.5. Combinatorial and graph-theoretic properties 
can also illuminate the limitations of our axiom systems, as shown in Sections 3.6 
and 3.9. The chapter concludes with a hint at the wealth of related topics and a 
list of references. 


3.1 Pigeons and Trees 


I wonder about the trees. 
— Robert Frost, The Sound of Trees 


The chapter introduction shows one way to use pigeons to distinguish between 
some finite and infinite sets. We could use this as a basis for defining finite sets, 
but this approach has some drawbacks that we will see in Section 3.4. It is more 
straightforward to say that a set is infinite if its not finite, and that a set is finite if 
its elements can be matched with a bounded initial segment of N. For example, 
the set {A, B, C, D} is finite, because the matching in Figure 3.1 exists. Note that 
the least integer not used in this matching is 4, which is also the size of the set 
{A, B,C, D}. This nifty trick, the result of using 0 in our matchings, reappears 
in Section 3.5. 

Using the preceding notion of infinite and finite sets, we can propose another 
pigeonhole principle. Suppose we have an infinite number of pigeons that we stuff 
into a finite number of pigeonholes. Momentarily disregarding physical consider- 
ations, we must have at least one pigeonhole that contains an infinite number of 
pigeons. Letting P be the set of pigeons, H the set of holes, and f the stuffing 
function, we obtain the following theorem. 


Theorem 3.1 (Infinite Pigeonhole Principle). Suppose P is infinite, H is finite, 
and f : P — H. Then there is an element h € H such that the pre-image set 
{p € P| f(p) = h} is infinite. 


Proof. Let P, H, and f be as in the hypothesis of the theorem. In particular, let 
H = {ho,hi,..., hn}. Suppose, by way of contradiction, that for each h; € H, 
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FIGURE 3.2. A tree with labels. 


the set P; = {p € P| f(p) = h;} has s; elements. Because P can be written as 
P= PUP, U---UP,, we see that Dex s; 1s the size of P. Thus P is finite, 
providing the desired contradiction. - oO 


A physicist might suggest that the density of matter resulting from cramming 
an unbounded number of pigeons into a bounded pigeonhole would yield a fusion 
explosion, obliterating any evidence that could be used by litigious animal rights 
advocates. Home experiments with actual live pigeons are strongly discouraged. 
Despite the physical impracticality of our theorem, it is handy for proving a very 
nice theorem about trees. 

As stated in Chapter 1, a tree is a connected acyclic graph. For big trees, it is 
handy to designate a root and label the vertices. Figure 3.2 is an example. As a 
convenient convention, we always think of the root r as the bottom of the tree 
and vertices farther from r as being higher in the tree. A path through a tree is 
a path leading up and away from the root. For example, r, a1, b2 and r, a1, b1, co 
are paths in the tree above. The sequence r, a, bg is not a path, because a bo is 
not an edge in the graph. If we add the edge a bo, the resulting graph is not a tree. 
(Find the cycle!) 

A level in a tree is the collection of all vertices at a fixed distance from the root. 
The levels in our sample tree are {r}, {ao, a1}, {bo, bi, b2} and {co, c1, co}. If v 
is a vertex and w is a neighboring vertex in the next higher level, then we call w 
an immediate successor of v. In the sample, b; is an immediate successor of a1, 
and bo is not. We can even say that c; is a successor of a1, but not an immediate 
successor. 

The vertex labels in the sample tree are arbitrary; if we want more than 26 
levels, we could use a different labeling scheme. It is even possible to reuse labels 
in some circumstances, as shown in Exercise 2. 

Now we are ready to state K6nig’s Lemma. The result concerns infinite trees, 
that is, trees with an infinite number of vertices. Essentially, Konig’s Lemma says 
that big skinny trees are tall. 
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Theorem 3.2 (K6nig’s Lemma). /f T is an infinite tree and each level of T is 
finite, then T contains an infinite path. 


Proof. Let T be an infinite tree in which every level is finite. Let Lo = {r}, Li, 
LI, ...be the levels of T°. We will construct a path as follows. Let r be the first 
element of the path. There are infinitely many vertices in T’ above r. Each of these 
vertices is either in L, or above a unique vertex in L;. Map each of the vertices 
above r to the vertex of L; that it is equal to or above. We have mapped infinitely 
many vertices (pigeons) to the finitely many vertices of L, (pigeonholes). By 
Theorem 3.1, there is at least one vertex of L, that is above r and has infinitely 
many vertices above it; pick one and call it v;. The path so far is r, v;. Since there 
are infinitely many vertices above v1, we can replace r by v1 in the preceding 
argument and select v2. Similarly, for each n € N, when we have found v,, we 
can find v,41. Thus T contains an infinite path. Oo 


K6nig’s Lemma appears in the 1927 paper of Dénes K6nig [179]. Some authors 
(e.g [218]) refer to the lemma as KGnig’s Infinity Theorem. The name K6nig’s 
Theorem is usually reserved for an unrelated result on cardinal numbers proved 
by Julius K6nig, another (earlier) famous Hungarian mathematician. 


Exercises 


1. Suppose we arrange finitely many pigeons in infinitely many pigeon holes. 
Use the Infinite Pigeonhole Principle to prove that there are infinitely many 
pigeonholes that contain no pigeons. 


2. Reusing labels in trees. 
Figure 3.3 shows an example of a tree where labels are reused. 


FIGURE 3.3. A tree with reused labels. 


Note that in this tree, each vertex can be reached by a path corresponding 
to a unique sequence of labels. For example, there is exactly one vertex 
corresponding to r, 0, 1. 


3. 


3.2 
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(a) Give an example of a tree with badly assigned labels, resulting in two 
vertices that have the same sequence of labels. 


(b) Prove that if the immediate successors of each vertex in a tree have 
distinct labels, then no two vertices can have matching sequences of 
labels. 


(c) Prove the converse of part 2b. 


2-coloring an infinite graph. 

Suppose G is a graph with vertices V = {v; | i © N} and every finite 
subgraph of G can be 2-colored. Use K6nig’s Lemma to prove that G is 
2-colorable. (Hint: Build a tree of partial colorings. Put the vertex root, 
red, blue, blue in the tree if and only if assigning red to vo, blue to v;, and 
blue to v2 yields a 2-coloring of the subgraph with vertices {vo, v1, vo}. An 
infinite path through such a tree will be a coloring of G. You must prove 
that the tree is infinite and that each level is finite.) 


. Construct an infinite graph where each finite subgraph can be colored using 


a finite number of colors, but where infinitely many colors are needed to 
color the entire graph. (Hint: Use lots of edges.) 


. Heine—Borel Theorem on compactness of the real interval [0, 1]. 


Use Kénig’s Lemma to prove that if (ao, bo), (a1, b1),... are open inter- 
vals in R and [0,1] C (ao, 60) U (a1, 61) U---, then for some finite value 
n, [0,1] © (ao, bo) U (a1, 61) U-++ U (Gn, bn). (Hint: Build a tree where 
the labels in the 2th level are the closed intervals obtained by removing 
(ao, 60) U (a1, b1) U+ ++ U (aj, b;) from [0, 1] and the successors of a vertex 
v are labeled with subintervals of the interval for v. Use the fact that the in- 
tersection of any sequence of nested closed intervals is nonempty to show 
that the tree contains no infinite paths. Apply the contrapositive of K6nig’s 
Lemma.) 


Ramsey Revisited 


Ah! the singing, fatal arrow, 
Like a wasp it buzzed and stung him! 
— H. W. Longfellow, The Song of Hiawatha 


Suppose that we 2-color the edges of Ag, the complete graph with six vertices, 
using the colors red and blue. As we proved in Chapter 1, the colored graph must 
contain a red K3 or a blue K3. Since we can 2-color Ks in a way that prevents 
monochromatic triangles, /’g is the smallest graph that must contain a monochro- 
matic triangle. Thus, the Ramsey number /2(3, 3) is 6, as noted in Theorem 1.61. 
If we want to guarantee a monochromatic (‘4 subgraph then we must 2-color Kj, 
because R(4,4) = 18. Exact values for R(p, p) when p > 5 are not known, but by 
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the Erd6s—Szekeres bound (Theorem 1.63) we know that these Ramsey numbers 
exist. 

Suppose that G' is the complete graph with vertices V = {v; | i © N}. If we 2- 
color the edges of G, what can we say about monochromatic complete subgraphs? 
Since G contains Kg, it must contain a monochromatic K’3. Similarly, since G 
contains Kg, it must contain a monochromatic Ky. For p > 5, we know that 
R(p, p) is finite and that G contains K’p(,,,) as a subgraph, so G must contain a 
monochromatic K’,. So far we know that G must contain arbitrarily large finite 
monochromatic complete subgraphs. As a matter of fact, G contains an infinite 
complete monochromatic subgraph, though this requires some proof. 


Theorem 3.3. Let G be a complete infinite graph with vertices V = {v; |i € N}. 
Given any 2-coloring of the edges, G will contain an infinite complete monochro- 
matic subgraph. 


Proof. Suppose the edges of G are colored using red and blue. We will build an 
infinite subsequence (w; | 7 € N) of V by repeatedly applying the pigeonhole 
principle (Theorem 3.1). Let wo = vo. For each i > 0, the edge vov; is either 
red or blue. Since this assigns v; to one of two colors for each 27 > 0, there is 
an infinite set of vertices Vo such that all the edges {vpv | v € Vo} are the same 
color. Suppose we have selected w,, and V,,. Let w,41 be the lowest-numbered 
vertex in V,,, and let V,,,; be an infinite subset of V,, such that the edges in the 
set {wr4iv | v © Vn4i} are the same color. This completes the construction of 
the sequence. 

This sequence (w; | 7 © N) has a very interesting property. If i < j < k, 
then w; and wz, are both in V;, and consequently w;w; and w;w, are the same 
color! We will say that a vertex w; is blue-based if 7 > 7 implies w;w; is blue, 
and red-based if 7 > 7 implies w;w, is red. Each vertex in the infinite sequence 
(w; | i € N) is blue-based or red-based, so by the pigeonhole principle there 
must be an infinite subsequence (w;,, w;,,...) where each element has the same 
color base. As a sample case, suppose the vertices in the subsequence are all blue- 
based. Then for each j < k, since w;, is blue-based, the edge wj, w;,, is blue. Thus 
all the edges of the complete subgraph with vertices {w;,,w;,,...} are blue. If 
the subsequence vertices are red-based, then the edges of the associated infinite 
complete subgraph are red. O 


Using the preceding theorem, we can prove that the finite Ramsey numbers 
exist without relying on the Erdés—Szekeres bound. 


Theorem 3.4. For each n € N there is anm € N such that R(n,n) = m. 


Proof. By way of contradiction, suppose that there is an n such that for every 
m there is a 2-coloring of the edges of K,, that contains no monochromatic K,, 
subgraph. Let G' be the complete graph with vertices V = {v; | i € N}. Suppose 
E = {e;|7 € N} is an enumeration of the edges of G. Construct a tree T of partial 
edge colorings of G' as follows. Include the sequence root, co, €1, €2,.--,ck inT 
if and only if whenever edge e; is colored color c; for all 7 < k, the subgraph of 
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G containing €9, €1,...,€% contains no monochromatic K,,. The kth level of T 
contains at most 2” vertices, so each level is finite. Since we have assumed that 
there is a way of coloring any K,,, so that no monochromatic K,, appears, T is 
infinite. By K6nig’s Lemma (Theorem 3.2), T’ has an infinite path. This infinite 
path provides a 2-coloring of G that contains no monochromatic K,,. Thus for 
this coloring, G has no infinite complete monochromatic subgraph, contradicting 
the preceding theorem. Our initial supposition must be false, and so for each n, 
there is an m such that R(n,n) =m. oO 


We just used the infinite pigeonhole principle, infinite trees, and colorings of 
infinite graphs to prove a result about finite graphs! (In doing so, we are imitating 
Ramsey [232].) Besides being inherently fascinating, infinite constructions are 
very handy. Furthermore, the arguments are easily generalized. In order to take 
full advantage of our work, we need some new notation. 

Here come the arrows! The notation « — (A)? means that every c-colored 
complete graph on « vertices contains a monochromatic complete subgraph with 
A vertices. Most people pronounce « — (A)? as “kappa arrows lambda 2 c.” The 
statement that R(3,3) = 6 combines the facts that 6 > (3)3 (KX is big enough) 
and 5 4 (3)3 (Ks is not big enough). If we imitate set theorists and write w for 
the size of the set V = {v; | i € N}, we can rewrite Theorem 3.3 as w > w3. 
Abbreviating “for all n” by Vn and “there exists an m” by 4m, Theorem 3.4 
becomes Ynim m — (n)3. 

Arrow notation is particularly useful if we want to use lots of colors. It is easy 
to check that if every use of two colors is replaced by some finite value c in the 
proof of Theorem 3.3, the result still holds. The same can be said for Theorem 


3.4. Consequently, for any c € N we have 


2 
a 


w—(w)? and Wnimm — (n) 


Note that when c is largish, the arrow notation is particularly convenient. For 
example, the statement “m is the least number such that m — (3)2” translates into 
Ramsey number notation as the unwieldy formula R(3, 3, 3,3, 3,3,3,3,3) =m. 
Nobody would want to translate m — (3)?p99. On the other hand, R(3,4) = 9 
does not translate into our arrow notation. 

The 2 in « — (A)? indicates that we are coloring unordered pairs of elements 
taken from a set of size «. When we edge color a graph, we are indeed assigning 
colors to the pairs of vertices corresponding to the edges. However, we can extend 
Ramsey’s theorem by coloring larger subsets. The resulting statements are still 
very combinatorial in flavor, though they no longer refer to edge colorings. For 
example, the notation & — (A)” means that for any assignment of c colors to the 
unordered n-tuples of «, there is a particular color (say lime) and a subset X C k 
of size such that no matter how we select n elements from X, the corresponding 
n-tuple will be lime colored. The proofs of Theorems 3.3 and 3.4 can be modified 
to prove the following theorems. 


Theorem 3.5 (Infinite Ramsey’s Theorem). For alln € Nandc € N,w => (w)”. 


Cc 
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Proof. By induction on n. Exercise 2 gives hints. O 


Theorem 3.6 (Finite Ramsey’s Theorem). For all k,n,c € N, there isanm €N 
such thatm — (k)”. 


c 


Proof. Follows from Theorem 3.5. Exercise 3 gives hints. O 


Throughout this section we have been very picky about our infinite sets. For 
example, V = {v; | 7 € N} has a built-in matching with N. What happens if we 
look at graphs with a vertex for each real number? In Section 3.7 we will learn 
that the analog of Theorem 3.3 fails for an infinite graph of this sort. For what 
sorts of infinite graphs does Theorem 3.3 hold? To answer this question, we need 
a deeper understanding of the infinite. 


Exercises 


1. Let X = {x; | 7 © N} be a set. Suppose that the relation < is a partial 
ordering on X. That is, for any a, b,c € X, suppose that 


eaca, 
e ifa< bandb <a, thena = J, and 


eifa<b<c,thena<c. 


Use Theorem 3.3 to prove that there is an infinite subset Y C X such that 
either 


e for every a,b € Y, eithera < borb<a,or 


e for every a,b € Y, botha £ bandb <a. 


A subset of the first type is called a chain, and a subset of the second type 
is called an antichain. 


2. Prove Theorem 3.5. Begin by proving Theorem 3.5 for 2 colors. Proceed by 
induction on n. For n = 1, use the pigeonhole principle as a base case. For 
the induction step, assume that w — (w)#, and prove that w > (w)3}*? by 
imitating the proof of Theorem 3.3, substituting applications of w > (w)} 
for the use of the pigeonhole principle. 


Given the theorem for 2 colors, there are many ways to prove it for other 
finite numbers of colors. You could replace 2 by c everywhere in the proof 
you just did, or you could try proving the theorem for c colors and n-tuples 
by using the theorem for 2 colors and 2n-tuples. 


3. Prove Theorem 3.6. Imitate the proof of Theorem 3.4, using Theorem 3.5 
in place of Theorem 3.3. 
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Red 


Blue 


FIGURE 3.4. A tripartite graph representing a 2-coloring. 


4. One way to visualize coloring triples. 
We can represent a coloring of triples by attaching a claw to a triple that 
points in a particular direction. For example, the tripartite graph in Figure 
3.4 represents coloring {0, 1,2} red and {1,3, 4} blue. 


Figure 3.5 represents a 2-coloring of the ten triples that can be formed from 
the set V = {0,1,2,3,4}. You can check that every four-element subset of 
V contains a triple with a claw on the blue side and a triple with a claw on 
the red side. Thus, Figure 3.5 illustrates that 5 4 (4)3. 


FIGURE 3.5. A 2-coloring of the triples from {0, 1, 2, 3, 4}. 


(a) Find a different coloring that shows that 5 4 (4)3 and represent it as a 
tripartite graph. (How do you know that your coloring is significantly 
different?) 


(b) Find a tripartite graph that shows that 5 4 (3)3. 
(c) Devise a way to draw a similar graph that shows that 6 4 (3)3. 


(d) Find a tripartite graph that shows that 6 4 (4)3. Since every triple 
gets a claw, make your life easier by drawing only the red claws. 
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3.3. ZFC 


No one shall be able to drive us from the paradise that Cantor 
created for us. 
— David Hilbert [160] 


Paraphrasing Hilbert, in Cantor’s paradise mathematicians can joyfully prove new 
and rich results by employing infinite sets. Since we have been living reasonably 
comfortably in this paradise since the beginning of the chapter, Hilbert’s anxiety 
about eviction may seem misplaced. However, Russell and other mathematicians 
discovered some set-theoretic paradoxes that made the naive use of infinite sets 
very questionable. Hilbert responded by calling for a careful investigation with 
the goal of completely clarifying the nature of the infinite. 

One could argue that Hilbert’s call (made in 1925) had already been answered 
by Zermelo in 1908. In the introduction to [293], Zermelo claimed to have reduced 
the entire theory created by Cantor and Dedekind to seven axioms and a few 
definitions. Although we now use formulations of the axioms of separation and 
replacement that more closely resemble those of Fraenkel and Skolem, the most 
commonly used axiomatization of set theory, ZFC, consists primarily of axioms 
proposed by Zermelo. The letters ZFC stand for Zermelo, Fraenkel, and Axiom 
of Choice. Although Skolem does not get a letter, it would be hard to overestimate 
his influence in recasting ZFC as a first order theory. 

ZFC succinctly axiomatizes what has become the de facto foundation for stan- 
dard mathematical practice. With sufficient diligence, it would be possible to for- 
malize every theorem appearing so far in this book and prove each of them from 
the axioms of ZFC. Since these proofs can be carried out in a less formal setting, 
foundational concerns are insufficient motivation for adopting an axiomatic ap- 
proach. However, many of the results in Sections 3.4 through 3.10 cannot even be 
stated without referring to ZFC. We will use ZFC as a base theory to explore the 
relative strength of some very interesting statements about sets. In particular, ZFC 
will be central to our discussion of large cardinals and infinite combinatorics. 


3.3.1 Language and Logical Axioms 


The comfort of the typesetter is certainly not the summum bonum. 
— Gottlob Frege [112] 


Before we discuss the axioms of ZFC, we need to list the symbols we will use. 
Although some of these symbols may be unfamiliar, they can be used as a very 
convenient shorthand. 

Variables can be uppercase or lowercase letters with subscripts tacked on if 
we please. Good examples of variables include A, B, x, and y3. The symbol 0 
denotes the empty set, and P and U are function symbols for the power set and 
union. The exact meaning of 0, P(), and Ux are determined by the axioms in 
the next section. (Uz is not a typographical error; a discussion appears later.) A 
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Formula | Translation 
ale] not 0 

OAw @ and w 
OV w 0 or w 


dw if 0 then w 

dow 0 if and only if 

Vx for all sets x, 6 holds 

dé there is a set x such that @ holds 


TABLE 3.1. Translations of connectives and quantifiers. 


term is a variable, the symbol 9, or the result of applying a function to a term. 
In ZFC, terms always denote sets. Consequently, all the objects discussed in ZFC 
are sets. Some early formalizations of set theory include distinct objects with no 
elements. These objects are usually called atoms or urelements. They do not show 
up in ZFC. 

The atomic formulas of ZFC are x € y and x = y, where x and y could be any 
terms. As one would expect, the formula x € y means z is an element of y. The 
connection between € and = is partly determined by the axiom of extensionality 
(in the next section) and partly determined by the fact that = really does denote 
the familiar equality relation. 

All other formulas of ZFC are built up by repeatedly applying logical connec- 
tives and quantifiers to the atomic formulas. Table 3.1 lists typical formulas and 
their translations. The letters 9 and w denote formulas of ZFC. 

Specifying that ZFC is a first order theory implicitly appends the axioms for 
predicate calculus with equality to the axioms for ZFC. In a nutshell, these logical 
axioms tell us that the connectives and quantifiers have the meanings shown in 
Table 3.1, and that = is well behaved. In particular, we can substitute equal terms. 
Thus, if 2 = y and @(x) both hold, then 0(y) holds, too. As a consequence, we 
can prove the following theorem. 


Theorem 3.7. Equal sets have the same elements. Formally, 


r=yoVi(terotey). 


Proof. Suppose x = y. Fix t. If t € a, then by substitution, ¢ € y. Similarly, if 
t € y, then ¢t € x. Our choice of t was arbitrary, so Vt(t EC x ot Ey). O 


We could completely formalize the preceding argument as a symbolic logic 
proof in any axiom system for predicate calculus with equality. Some good formal 
axiom systems can be found in Mendelson [201] or Kleene [176] by readers with 
a frighteningly technical bent. 

It is very convenient to write x C y for the formula Vi(t € x > t € y). Using 
this abbreviation and only the axioms of predicate calculus, we could prove that 
Va(a C x), showing that every set is a subset of itself. We could also prove that 
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containment is a transitive relation, which can be formalized as 
VaVyV2((x CyAyCz) 7 2@C 2). 


The preceding results (which appear in the exercises) rely on logical axioms rather 
than on the actual nature of sets. To prove meaty theorems, we need more axioms. 


3.3.2 Proper Axioms 


... 1 tasted the pleasures of Paradise, which produced these Hell 
torments... 
— Pangloss, in Candide 


The axiom system ZFC consists of nine basic axioms plus the axiom of choice. 
Typically, the nine basic axioms are referred to as ZF. In this section, we will 
examine the axioms of ZF, including their formalizations, some immediate appli- 
cations, and a few random historical comments. This should be less painful than 
the affliction of Pangloss. 


1. Axiom of extensionality: If a and b have the same elements, then a = b. 
Formally, 
Va(r Eavxeb)oa=b. 


This axiom is the converse of Theorem 3.7, so ZF can prove that a = 0 if 
and only if a and b have exactly the same elements. Using this, we can prove the 
following familiar theorem about the connection between subsets and equality. 


Theorem 3.8. For all sets a and b, a = bif and only ifa C band b C a. Formally, 
VaVb(a = bo (ac bAbCa)). 


Proof. First suppose that a = b. Since a C a (see Exercise 1), by substitution we 
have a C bandb C a. Thus,a=b— (ac bAbCa). 

To prove the converse, suppose a C b and b C a. Since a C 6, for every 
x we have that if x € a then x © 6b. Similarly, since b C a, x © b implies 
x € a. Summarizing, for all z, x € a a © b. By the axiom of extensionality, 
a=b. O 


The axiom of extensionality and the preceding theorem give us strategies for 
proving that sets are equal. Most proofs of set equality apply one of these two 
approaches. 


2. Empty set axiom: @ has no elements. Formally, Vz(a« ¢ 0). 


The empty set has some unusual containment properties. For example, it is a 
subset of every set. 


Theorem 3.9. () is a subset of every set. Formally, Vt(0 C t). 
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Proof. The proof relies on the mathematical meaning of implication. Suppose t 
is a set. Pick any set x. By the empty set axiom, x ¢ 9, so x € @ implies zx € t. 
(When the hypothesis is false, the implication is automatically true. If I am the 
king of the world, then you will send me all your money. The statement is true, 
but no checks have arrived.) Formally, Va(x € 0 — x € t),so@ Ct. Oo 


The preceding proof also implies that @ C Q, although Exercise 1 provides a 
more direct proof. 


3. Pairing axiom: For every « and y, the pair set {x, y} exists. Formally, 


VaVyszVi(t Eze (t=aVt=y)). 


In the formal version of the axiom, the set z has x and y as its only elements. 
Thus, z is {x,y}. The pair sets provided by the pairing axiom are unordered, so 
{x,y} = {y, x}. The pairing axiom can be used to prove the existence of single- 
element sets, which are often called singletons. 


Theorem 3.10. For every x, the set {x} exists. That is, VxdzVt(t € zo t= 2). 


Proof. Fix x. Substituting x for y in the pairing axiom yields a set z such that 
Vti(t Ez (=a Vt =2)). By the axiom of extensionality, z = {x}. O 


The empty set axiom, the pairing axiom, and Theorem 3.10 on the existence of 
singletons are all combined in Zermelo’s original axiom of elementary sets [293]. 
As an immediate consequence he solves Exercise 4, showing that singleton sets 
have no proper subsets. 

The statement of the next axiom uses the union symbol in an unusual way. In 
particular, we will write U{2, y} to denote the familiar « U y. This prefix notation 
is very convenient for writing unions of infinite collections of sets. For example, 
if X = {x; | i € N}, then the infinite union xp U x1 U v2 U--+ can be written as 
UX, eliminating the use of pesky dots. The union axiom says that UX contains 
the appropriate elements. 


4. Union axiom: The elements of UX are precisely those sets that are elements 
of the elements of X. Formally, 


Vi(t C UX  Jy(t ey Ay EX)). 


Exercise 5 is a verification that U{a, y} is exactly the familiar set « U y. The 
notion of union extends naturally to collections of fewer than two sets, also. By 
the union axiom, ¢ € U{x} if and only if there is a y € {a} such that t € y, 
that is, if and only if ¢ € x. Thus, U{x} = a. For an exercise in wildly vacuous 
reasoning, try out Exercise 6, showing that Uf = 0). 

Like the union axiom, the power set axiom defines one of our built-in functions. 
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5. Power set axiom: The elements of P(X) are precisely the subsets of X. 
Formally, 
Vt(te P(X) Otc X). 


This is the same power set operator that appears in the first chapter of dozens 
of mathematics texts. For example, 


P({a, b}) = {0, {a}, {b}, {a, b}}. 


If X is a finite set of size n, then P(X) has 2” elements. Thus for finite sets, the 
size of P(X) is always larger than the size of X. In Section 3.5 we will prove that 
this relation continues to hold when _X is infinite. 

It may seem odd that we do not have other built-in functions, like intersection, 
set-theoretic difference, or Cartesian products. However, all these operations can 
be defined using the next axiom and are omitted in order to reduce redundancy in 
the axioms. Our version of the separation axiom is the invention of Skolem [255]. 
Both Skolem and Fraenkel [109] proposed emendations to Zermelo’s version of 
the separation axiom. 


6. Separation axiom: If ~(a) is a formula and X is a set, then the set denoted 
by {a € X | (x)} exists. More formally, given any set X and any formula 
~(a) in the language of ZFC, if (a) does not contain the variable S, then 


ASVa(x Ee So (xe X Ay(a))). 


Note that v(a) may contain unquantified variables that can be viewed as 
parameters. Thus S can be defined in terms of X and other given sets. 


We can use the separation axiom to prove that intersections exist. It is nice to 
use intersection notation that is parallel to our union notation, so we write N{a, b} 
for a M b. In general, an element should be in MX precisely when it is in every 
element of X. 


Theorem 3.11. For any nonempty set X, MX exists. That is, for any set X there 
isa set Y such that 


Va(r EY oH Vite X > 2 e€t)). 


Proof. Fix X. Let Y = {a € UX | Vt(t © X — « € t)}. By the separation 
axiom, Y exists. We still need to show that Y is the desired set. By the definition 
of Y, if x € Y, then Vt(t € X — «x € t). Conversely, if Vi(t € X — x € t), 
then since X is nonempty, St(t € X A a € t). Thus « € UX. Because z € UX 
and Vt(t € X — x € t), we have x € Y. Summarizing, x € Y if and only if 
Vite X eed), oO 


It is also possible to show that MX is unique. (See Exercise 8.) Since we can 
show that for all X the set MX exists and is unique, we can add the function 
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symbol / to the language of ZFC. Of course, the symbol itself could be subject 
to misinterpretation, so we need to add a defining axiom. The formula 


Va(a enx @ Vti(te X > 2 €t)) 


will do nicely. The resulting extended theory is more convenient to use, but proves 
exactly the same theorems, except for theorems actually containing the symbol MN. 
Mathematical logicians would say that ZFC with M is a conservative extension of 
ZFC. 

Using the same process, we can introduce other set-theoretic functions. For 
example, we can specify a set that represents the ordered pair (2, y), and define 
the Cartesian product X x Y = {(#,y)|a eX Ay € Y}. Ordered n-tuples can 
be defined in a number of reasonable ways from ordered pairs. We could define 
the relative complement of Y in X by X —Y = {a € X | ax ¢ Y}. See Exercises 
9, 10, and 11 for more discussion of these operations. 

There are some significant restrictions in the sorts of functions that could be 
conservatively added to ZFC. For example, as above it is acceptable to introduce 
the relative complement, but not a full-blown general complement. (Books usually 
use X or X° to denote a general complement.) Given a general complement, we 
could prove that X U X° existed. This would give us a set of all sets, but that is 
prohibited by the separation axiom. 


Theorem 3.12. There is no universal set. That is, there is no set U such that 
Va(a € U). 


Proof. Suppose by way of contradiction that Vx(a € U). Applying the separation 
axiom, there is a set X such that X = {z © U | z € z}. Note that for all z, z © X 
if and only if z € U and z ¢ z. Furthermore, z € U and z ¢ z if and only if 
z ¢ z. Thus, z € X if and only if z ¢ z for any z we care to choose. In particular, 
substituting X for z gives us X € X if and only if X ¢ X, yielding the desired 
contradiction. O 


The preceding proof contains the gist of Russell’s paradox. Briefly, Russell’s 
paradox says that the existence of {z | z ¢ z} leads inexorably to contradictions. 
Note that the existence of {z | z ¢ z} is not proved by the separation axiom, 
because the specified set is not bounded. For any bound X, we can prove that 
{z © X | z ¢ z} exists; it is just a harmless subset of X. By requiring bounds on 
definable sets, we cleverly sidestep paradoxes that ensnare the users of naive set 
theory. For another experiment with Russell’s style of argument, try Exercise 12. 

Part of Hilbert’s motivation for the rigorous study of set theory was to gain 
a deeper understanding of infinite sets. So far, our axioms do not guarantee the 
existence of a single infinite set. (Readers who love technical details may want to 
construct a model of axioms | through 6 in which every set is finite. The universe 
for this model will be infinite, but each element in the universe will be finite.) 

One way to construct an infinite set is to start with @ and successively apply 
Theorem 3.10. If we let ro = 0 and x, 41 = {x,} for each n, this yields a set for 
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each natural number. In particular, 79 = 0, x1 = {0}, x2 = {{0}}, and so on. 
The next axiom affirms the existence of a set containing all these sets as elements. 


7. Infinity axiom: There is a set Z such that (i) @ € Z and (ii) if x € Z, then 
{x} € Z. Formally, 


AZMWEZAVa(az EZ > Ay(ye ZAVUt Ey ot=2z)))). 


The axiom of infinity guarantees the existence of some set satisfying properties 
(i) and (11). By applying the power set axiom, the separation axiom, and taking 
an intersection, we can find the smallest set with this property. For details, see 
Exercise 13. 

Zermelo’s axiomatization of set theory consists of axioms | through 7 plus 
the axiom of choice. We will discuss the axiom of choice shortly. In the mean- 
time, there are two more axioms that have been appended to ZF that should be 
mentioned. The first of these is the axiom of replacement, proposed in various 
versions by Fraenkel ({107], [108], and [110]), Skolem [255], and Lennes [186]. 


8. Replacement axiom: Ranges of functions restricted to sets exist. That is, if 
f(a) is a function and D is a set, then the set R = {f(x) | x € D} exists. 
More formally, if w(a, y) is a formula of set theory such that 


VaWyW2((W(x,y) A (a, 2)) > y = 2), 


then for every set D there is a set R such that 


Vy(y € Ro Aa(x# € DA W(a,y)). 


Note that the formula ~(z,y) in the formal statement of the axiom can be 
viewed as defining the relation f(x) = y. The replacement axiom is useful for 
proving the existence of large sets. In particular, if we assume that ZFC and the 
continuum hypothesis are consistent, in the absence of the replacement axiom it 
is impossible to prove that any sets of size greater than or equal to X,, exist. (To 
find out what X., is, you have to stick around until Section 3.5.) 

The final axiom of ZF is the regularity axiom. In a nutshell, it outlaws some 
rather bizarre behavior, for example having x € y € x. Attempts to avoid these 
strange constructs can be found in the work of Mirimanoff [207], but Skolem 
[255] and von Neumann [276] are usually given credit for proposing the actual 
axiom. 


9. Regularity axiom: Every nonempty set x contains an element y such that 
xy = Q. Formally, 


Va(x £0 Ay(yerArny=9)). 
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The idea here is that € can be viewed as a partial ordering on any set by letting 
x <ymeanz € y. The regularity axiom says that every set has a minimal element 
in this ordering. This rules out loops (like x € y € «) and infinite descending 
chains (like --- € 3 € rq € 2 € 4%). The following theorem shows that tight 
loops are outlawed. 


Theorem 3.13. For all x, x ¢ x. 


Proof. By way of contradiction, suppose x € x. By Theorem 3.10, we know the 
set X = {x} exists. The set X is nonempty, so by the regularity axiom, there is 
an element y € X such that X Ny = Q. The only element of X is x, so y = x and 
X Max =. However, x € X andx € xz,sox € XN x =O, acontradiction. O 


Summarizing this section, the proper axioms of ZF are: 
1. Axiom of extensionality, 

2. Empty set axiom, 

Pairing axiom, 

Union axiom, 

Power set axiom, 


Separation axiom, 


og ae 


Infinity axiom, 
8. Replacement axiom, and 
9. Regularity axiom. 


We are still missing one axiom from Zermelo’s list, the axiom of choice. 


3.3.3 Axiom of Choice 


Vizzini: ... so I can clearly not choose the wine in front of me. 
Man in black: You’ve made your decision then? 
Vizzini: [happily] Not remotely! 
— The Princess Bride 


Suppose that we, like Vizzini, are faced with the task of selecting one glass from a 
set of two glasses. Since the set of glasses is nonempty, we can select one element 
and get on with our lives, which hopefully will be much longer than Vizzini’s. 
To be very technical, the justification for our selection is the logical principle 
of existential instantiation. Similarly, using only axioms of ZF, we can always 
select one element from any nonempty set, without regard for the size of the set. 
Furthermore, we could repeat this process any finite number of times, so we can 
choose one element from each set in any finite list of nonempty sets. 
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By contrast, making an infinite number of choices simultaneously can often be 
problematic, depending on the circumstances. Suppose that we have an infinite 
collection of pairs of boots. We can pick one boot from each pair by specifying 
that we will select the left boot from each pair. Because each nonempty set (pair 
of boots) has a designated element (left boot), ZF suffices to prove the existence 
of the set of selected boots. Working in ZF, we cannot carry out the same process 
with an infinite collection of pairs of socks, because socks are not footed. We 
need a new axiom. In [244], Russell discusses this boot problem, though rather 
than selecting socks, he considers the case where “the left and right boots in each 
pair are indistinguishable.” Cruel shoes indeed! 

The axiom of choice guarantees the existence of a set of selected socks. The 
following version of the axiom is very close to that of Zermelo [293]. 


10. Axiom of choice (AC): If 7’ is a set whose elements are all sets that are 
nonempty and mutually disjoint, then UT’ contains at least one subset with 
exactly one element in common with each element of 7’. 


Most recent works use a formulation of the axiom of choice that asserts the 
existence of choice functions. In terms of socks, when a choice function is applied 
to a pair of socks, it outputs a designated sock. In the following statement, if T' is 
a set of pairs of socks, t would be a pair of socks, and f(t) would be a sock. 


10’. Axiom of choice (AC2): If T’ is a set of nonempty sets, then there is a 
function f such that for every t € T, f(t) € t. 


We use ZFC to denote ZF plus either version of AC. This is not imprecise, since 
we can prove that the two versions of the axiom of choice are interchangeable. 


Theorem 3.14. ZF proves that AC holds if and only if AC2 holds. 


Proof. First assume all the axioms of ZF plus AC. Let T’ be a set of nonempty 
sets. Define the function g with domain T by setting 


g(t) ={(t,y) ly € t} 


for each t € T. Essentially, g(t) looks like the set ¢ with a flag saying “I’m in t” 
attached to each element. By the replacement axiom, the set Y = {g(t) | t € T} 
exists. The elements of Y are nonempty and disjoint, so by AC there is a set S' that 
contains exactly one element from each element of Y. Thus S is a set of ordered 
pairs of the form (t, y), where exactly one pair is included for each t € T’. Let 
f(t) be the unique y such that (¢, y) € S. Then f is the desired choice function. 
To prove the converse, assume ZF plus AC2. Let T’ be a set whose elements are 
nonempty and disjoint. By AC2, there is a function f such that for each t € T, 
f(t) € t. By the replacement axiom, S = {f(t) | t € T} exists. S is the desired 
subset of UT. O 
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Zermelo ([291], [292]) used AC to prove that every set can be well-ordered. 
Hartogs [155] extended Zermelo’s result by proving that AC is equivalent to this 
well-ordering principle. Hartogs’ result is identical in format to the equivalence 
result that we just proved. What really makes Hartogs’ result and our equivalence 
theorem interesting is the fact that AC can neither be proved nor disproved in ZF. 
(Technically, we just implicitly assumed that ZF is consistent. I assure you that 
many people make much more bizarre assumptions in their daily lives.) Gédel 
[125] proved that ZF cannot disprove AC, and Cohen ([58], [59]) showed that ZF 
cannot prove AC. Thus our equivalence theorem and the theorem of Hartogs list 
statements that we can add interchangeably to strengthen ZF. In later sections we 
will see more examples of equivalence theorems and more examples of statements 
that strengthen ZF and ZFC. 


Exercises 


1. Prove that set containment is reflexive. That is, prove Va(a C «). (This 
requires only logical properties.) 


2. Prove that set containment is transitive. That is, prove 
VaVyVe((a2 CyAyC 2) 7 4C2). 
(This requires only logical properties.) 
3. Prove that the empty set is unique. That is, if Vr(x ¢ y), then y = 0). 
4. Prove that if y C {x}, then y = Q or y = {z}. 


5. Prove that U{x, y} is exactly the familiar set x U y. That is, prove that 
t € Ufa, y} if and only ift € v ort € y. 


6. Prove that Ud = 0. 


7. Find P(0), P(P()), and P(P(P(O))). (To make your answers look really 
bizarre and drive your instructor nuts, write { } for 0.) 


8. Prove that NX is unique. That is, show that if Y is a set that satisfies the 
formula Va(a € Y — Vi(t € X — x € t)) and Z isa set that satisfies the 
formula Va(a € Z @ Vt(t € X — «x € t)), then Y = Z. (Proving the 
existence of Y and Z requires the separation axiom (see Theorem 3.11), 
but this problem uses the axiom of extensionality.) 


9. Let X — Y denote the set{xr ¢ X |x ZY}. 


(a) Prove that for every X and Y, X — Y exists. 
(b) Prove that for every X and Y, X — Y is unique. 


(c) Under what circumstances does X — Y = Y — X? 
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10. Representations of ordered pairs. 


(a) Kuratowski [184] suggested that the ordered pair (a, b) can be repre- 
sented by the set {{a, b}, a}. (This encoding is still in use.) Using this 
definition, prove that (a,b) = (c, d) if and only if a = c and b = d. 


(b 


wm 


Using Kuratowski’s encoding, show that if X and Y are sets, then the 
set X x Y defined by X x Y = {(a,y)|ae X Ay € Y} exists and 
is uniquely determined by X and Y. 


(c 


wa 


Wiener [283] suggested that the ordered pair (x, y) can be represented 
by the set {{{x}, 0}, {{y}}}. If you dare, repeat parts 10a and 10b 
using this encoding. 

Show that encoding (a, b) by {a, {b}} leads to difficulties. (Find two 
distinct ordered pairs that have the same representation in this encod- 
ing.) 


(d 


wm 


11. Representations of n-tuples. 


(a) Usually, set theorists represent (a,b, c) by ((a, b), c), where pairs are 
represented using the Kuratowski encoding from Exercise 10. Using 
this representation prove the following: 

(i) (a,b,c) = (d,e, f) — (a=dAb=eAc=f), 
(ii) X x Y x Z exists, and 
(ii) X x Y x Z is unique. 


(b) To address type-theoretic concerns, Skolem [256] suggested repre- 
senting (a, b, c) by ((a, c), (b, c)). Repeat part 11a with this encoding. 


(c) Using parts 11a and 11b as the base cases in an induction argument, 
extend the statements in part 11a to n-tuples for each natural number 
n. Uf you do this, you clearly have a great love for long technical 
arguments. You might as well repeat the whole mess with the Wiener 
encoding.) 


Show that encoding (a, b,c) by {{a, b,c}, {a,b}, {a}} leads to diffi- 
culties. (You can find distinct triples with the same representation, or 
you can find an ordered pair that has the same representation as an 
ordered triple.) 


(d 


w 


12. Prove that for all X, P(X) ¢ X. (Hint: Suppose that for some X, P(X) C 
X. Define Y = {t € X |t ¢ t}. Show that Y € X and shop for a 
contradiction.) 


13. Let Z be the set provided by the infinity axiom. Let T be the set of subsets 
of Z that satisfy properties (i) and (ii) of the infinity axiom. Let Zp = NT. 


(a) Prove that T exists. (Hint: T C P(Z).) 
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(b) Prove that Zp exists. 


(c) Prove that if X satisfies properties (i) and (ii) of the infinity axiom, 
then Zp C X. 


14. Use the regularity axiom to prove that for all x and y either x ¢ y ory ¢ x. 


3.4 The Return of der Konig 


And Aragorn planted the new tree in the court by the fountain. 
—J.R.R. Tolkien, The Return of the King 


It may seem that the discussion of the last section strayed from our original topics 
in graph theory and combinatorics. However, AC is actually a statement about 
infinite systems of distinct representatives (SDR). As defined in Section 1.7.2, an 
SDR for a family of sets T' is a set that contains a distinct element from each set 
in 7’. For disjoint families, we have the following theorem. 


Theorem 3.15. ZF proves that the following are equivalent: 


I. AC. 


2. If T is a family of disjoint nonempty sets, then there is a set Y that is an 
SDR for T. 


Proof. First, assume ZF and AC and suppose T’ is a family of disjoint nonempty 
sets. By AC, there is a set Y C UT that has exactly one element in common with 
each element of 7’. Since the elements of T’ are disjoint, Y is an SDR for 7’. 

To prove the converse, suppose T’ is a family of disjoint nonempty sets. Let Y 
be an SDR for 7’. Then Y Cc UT’, and Y has exactly one element in common with 
each element of T’, as required by AC. O 


What if T’ is not disjoint? For finite families of sets, it is sufficient to know that 
every union of & sets has at least k elements. This is still necessary for infinite fam- 
ilies, but no longer sufficient. Consider the family of sets T = { Xo, X1, X2,...} 
defined by Xp = {1,2,3,...}, X1 = {1}, X2 = {2}, and so on. The union of 
any /; sets from T has at least k; elements. As a matter of fact, any collection of k 
sets from J’ has an SDR. However, the whole of JT’ has no SDR. To build an SDR 
for T’, we must pick some n as a representative for X9. This immediately leaves 
us with no element to represent X,,. We are out of luck. Note that if we chuck 
Xo, we can find an SDR for the remaining sets. (There are not many options for 
the representatives; it is hard to go wrong.) The infinite set X is the source of all 
our problems. If we allow only finite sets in the family, then we get a nice SDR 
existence theorem originally proved by Marshall Hall [145]. 


Theorem 3.16. Suppose T = {Xo, X1, X2,...} is a family of finite sets. T has 
an SDR if and only if for every k € N and every collection of k sets from T, the 
union of these sets contains at least k, elements. 
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Proof. Let T = {Xo, X1, X2,...} and suppose that each X; is finite. If T has 
an SDR, then for any collection of & sets, their representatives form a k element 
subset of their union. 

To prove the converse, assume that for every &: € N, the union of any & elements 
of T contains at least k elements. By Theorem 1.52, for each & the subfamily 
{Xo, X1,..., Xx} has an SDR. Let Y be the tree whose paths are of the form 
r,X0,%1,---,Xz, where x; € X; fori < k and {a9,21,...,x7,%} is an SDR for 
{Xo, X1,..., Xx}. Since arbitrarily large finite subfamilies of T have SDRs, the 
tree Y is infinite. Furthermore, the size of the kth level of the tree Y is at most 
|Xo| - |Xi|---|X;]|, where |X;| denotes the size of X;. Since these sets are all 
finite, each level is finite. By K6nig’s Lemma, Y has an infinite path, and that 
path is an SDR for T. O 


In the preceding proof we made no immediately obvious use of AC. Here is a 
question: Have we actually avoided the use of AC, or did we merely disguise it? 
The answer is that we have used some of the strength of AC in a disguised form. 

There are two very natural ways to restrict AC. Recall that AC considers a 
family of sets. We can either restrict the size of the sets or restrict the size of the 
family. If we require that each set is finite, we get the following statement. 


Axiom of choice for finite sets (ACF): If T is a family of finite, nonempty, 
mutually disjoint sets, then UT’ contains at least one subset having exactly 
one element in common with each element of T’. 


If we specify that the family can be enumerated, we get the following state- 
ment. (We say that an infinite set is countable if it can be written in the form 
{2 Lig Hayat) 


Countable axiom of choice (CAC): If T = {Xo, X1, Xo,...} is a family 
of nonempty, mutually disjoint sets, then UT’ contains at least one subset 
having exactly one element in common with each element of T’. 


Combining both restrictions gives us CACF, the countable axiom of choice for 
finite sets. The statement of CACF looks like CAC with the added hypothesis that 
each X; is finite. This weak version of AC is exactly what we used in proving 
Theorem 3.16. 


Theorem 3.17. ZF proves that the following are equivalent: 
I. Kénig’s Lemma. 
2. Theorem 3.16. 
3. CACF. 


Proof. The proof of Theorem 3.16 shows | implies 2. The proofs that 2 implies 3 
and 3 implies | are Exercises | and 2. O 
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The relationships between our various versions of choice are very interest- 
ing. It is easy to see that ZF proves AC—-CAC, AC->ACF, CAC->CACF, and 
ACF—CACF. It is not at all obvious, but can be shown, that not a single one of 
the converses of these implications is provable in ZF, and also that CACF is not a 
theorem of ZF. To prove that ZF cannot prove these statements we would assume 
that ZF is consistent and build special models where each particular statement 
fails. The models are obtained by lifting results from permutation models or by 
forcing. Jech’s book The Axiom of Choice [167] is an excellent reference. 

Since ZF proves that Konig’s Lemma is equivalent to CACF and CACF is not 
a theorem of ZF, we know that K6nig’s Lemma is not a theorem of ZF. Of course, 
ZFC can prove Konig’s Lemma, so it is perfectly reasonable to think of it as a 
theorem of mathematics. Also, ZF can prove some restrictions of K6nig’s Lemma, 
for example if all the labels in the tree are natural numbers. Many applications of 
K6nig’s Lemma can be carried out with a restricted version. 

Our proof closely ties Kénig’s Lemma to countable families of sets. As we 
will see in the next section, not all families are countable. We will see bigger sets 
where the lemma fails, and still bigger sets where it holds again. This is not the 
last return of K6nig. (KOnig means “king” in German.) 

In the introduction to this chapter we noted that if P is finite, then whenever 
HT is a proper subset of P there is no matching between P and H. A set X is 
called Dedekind finite if no proper subset of X can be matched with X. Thus, 
the introduction shows that if X is finite, then X is Dedekind finite. Exercise 4 
shows that CAC implies the converse. Thus, in ZFC, the finite sets are exactly 
the Dedekind finite sets. This characterization of the finite sets requires use of a 
statement that is weaker than CAC, but not provable in ZF [167]. 


Exercises 


1. Prove in ZF that Theorem 3.16 implies CACF. (Hint: Use disjointness to 
show that the union of any k sets contains at least / elements.) 


2. Challenging exercise. Prove K6nig’s Lemma using ZF and CACF. To do 
this, let S be the set of nodes in the tree that have infinitely many successors. 
Find an enumeration for S’. (It is easy to slip up and use full AC when 
finding the enumeration.) For each s € S, let X, be the set of immediate 
successors of s that have infinitely many successors. Apply CACF to the 
family {X, | « € S}. Use the selected vertices to construct a path through 
the tree. 


3. Disjointification trick. Suppose that {X,, | n © N} is a family of sets. For 
each n € N let X,, = {(n, x) | c € X,}. Show that {X,, | n € N} exists 
and is a disjoint family of sets. 


4. Use CAC to prove that every infinite set has a countable subset. (Hint: Sup- 
pose that W is infinite. For each k € N let W;, be the set of all subsets of W 
of size k. Apply CAC to a disjointified version of the family {W; | k € N}. 
Show that the union of the selected elements is a countable subset of W.) 
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5. Assume that every infinite set has a countable subset. Prove that if X cannot 
be matched with any proper subset of itself, then X is finite. (Hint: Suppose 
X is infinite and use a countable subset of X to find a matching between X 
and a proper subset of X.) 


3.5 Ordinals, Cardinals, and Many Pigeons 


Whenever Gutei Oshé was asked about Zen, he simply raised his 
finger. Once a visitor asked Gutei’s boy attendant, “What does your 
master teach?” The boy too raised his finger. Hearing of this, Gutei 
cut off the boy’s finger with a knife. The boy, screaming with pain, 
began to run away. Gutei called to him, and when he turned around, 
Gutei raised his finger. The boy suddenly became enlightened. 

— Mumon Ekai, The Gateless Gate 


The previous section contains some references to infinite sets of different sizes. 
To make sense of this we need to know what it means for sets to be the same 
size. We can illustrate two approaches by considering some familiar sets. Thanks 
to the gentleness of my religious training, I have the same number of fingers 
on my left and right hands. This can be verified in two ways. I can count the 
fingers on my left hand, count the fingers on my right hand, and check that the 
results match. Note that in the process of counting, I am matching fingers with 
elements of a canonical ordered set, probably {1, 2,3, 4,5}. By emphasizing the 
matching process, I can verify the equinumerousness of my fingers without using 
any canonical set middleman. To do this, I match left thumb with right thumb, left 
forefinger with right forefinger, and so on. When my pinkies match, I know that 
I have the same number of fingers on my left and right hands. One advantage of 
this technique is that it works without modification for people with six or more 
fingers on each hand. 

For infinite sets, either method works well. We will start by comparing sets 
directly, then study some canonical ordered sets, and finish the section off with 
some applications to pigeons and trees. 


3.5.1 Cardinality 


The big one! 
— Connie Conehead 


Suppose we write X x Y if there is a one-to-one function from X into Y, and 
X ~ Y if there is a one-to-one function from X onto Y. Thus X ~ Y means 
that there is a matching between X and Y. If X x Y and X ~ Y,so X can be 
embedded into but not onto Y, we will write _X ~< Y. With this notation, we can 
describe relative sizes of some infinite sets. 
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First consider the sets N = {0,1,2,...} and Z = {...,—2,—1,0,1,2,...}. 
Define the function f : N — Z by 


2n+1 ae 
Zz 


fn) = (ayn (7 


It is not hard to verify that if f(j) = f(k), then j = k, proving that f is a one- 
to-one function. Additionally, if m > 0, then f(2m — 1) = m, and if t < 0, then 
f(—2t) = t, so f maps the odd natural numbers onto the positive integers and 
the even natural numbers onto the negative integers and 0. Thus, f witnesses that 
N ~ Z, and we now know that N and Z are the same size. 

If X is a set satisfying N ~ X, then we say that X is countable (or countably 
infinite if we are being very precise.) We just prove that Z is countable. Not every 
infinite set is countable, as shown by the following theorem of Cantor. 


Theorem 3.18 (Cantor’s Theorem). For any set X, X ~< P(X). In particular, 
N ~ P(N). 


Proof. Define f : X — P(X) by setting f(t) = {t} for each t € X. Since f 
is one-to-one, it witnesses that XY < P(X). It remains to show that X ~ P(X). 
Suppose g : X — P(X) is any one-to-one function. We will show that g is not 
onto. Let y = {t € X | t ¢ g(t)}. Suppose by way of contradiction that for some 
x € X, g(x) = y. Because g(x) = y, x € g(x) if and only if x € y, and by the 
definition of y, x € y if and only if « ¢ g(x). Concatenating, we get x € g(x) 
if and only if « ¢ g(x), a clear contradiction. Thus y is not in the range of g, 
completing the proof. Oo 


One consequence of Cantor’s Theorem is that any function from P(N) into N 
must not be one-to-one. More combinatorially stated, if we try to ram a pigeon 
for each element of P(N) into pigeonholes corresponding to the elements of N, 
some pigeonhole must contain at least two pigeons. Another consequence is that 
by sequentially applying Cantor’s Theorem to an infinite set, we get lots of infinite 
sets, including some very big ones. 


Corollary 3.19. There are infinitely many infinite sets of different sizes. 
Proof. N is infinite, and N < P(N) < P(P(N))--- by Cantor’s Theorem. O 


Using only the definition of ~ and chasing some functions, we can prove that 
~ is an equivalence relation. (See Exercise 2.) In particular, for any sets A, B, 
and C’, we have 


e AWA, 
e if A~ B then B ~ A, and 


eifA~ BandB~C,thenA~C. 
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These are handy shortcuts, and it would be nice to have analogous statements for 
the < relation. We can easily show that A x A and that if A X Band B x C, 
then A x C. (See Exercise 3.) Symmetry does not hold for the < relation, but 
we can prove that if A x Band B x Athen A ~ B. This last statement is the 
Cantor—Bernstein Theorem, an incredibly handy shortcut for showing that sets are 
the same size. After discussing the proof and history of the theorem, we will look 
at some nice applications. 


Theorem 3.20 (Cantor—Bernstein Theorem). [f both X x Y and Y x X, then 
X~Y., 


Proof. Suppose f : A — Band g: B — A are one-to-one functions. We 
will sketch the construction of a function h : A — B that is one-to-one and 
onto. Define a set of subsets of A as follows. Let Ag = A, A, = g(B), and 
An = g(f(An—2)) for n > 2. In particular, writing g o f(A) for g( f(A)), we 
have 


Ao = A, 
A, = g(B), 
Az =g0 f(A), 


A3 =gofog(B), 
Ay = 0 fogo f(A), and 
As =gofogofog(B). 
Note that A,, is defined with n function applications. It goes “back and forth” n 


times. Using induction as described in Exercise 4a, it is fairly easy to prove the 
following claim. 


Claim 1: For all n, An D An41. 


Given Claim 1, define the sets A’, = Ap, — An+1 for each n. Also define the set 
Al, = MnenAn. These sets form a partition of A into disjoint pieces, as claimed 
below. Hints for the proof of this claim appear in Exercise 4b. 


Claim 2: For every x € A there is a unique n € {w,0,1,2,...} such that 
Ge Ay. 


Define the function h : A — B by the following formula: 


at = fe). abe GUA ADAG sacs 
| g(a) ifa € UL Aj, AS, Ab... }. 


By Claim 2, h(x) is well-defined and has all of A as its domain. It remains to show 
that h(a) is one-to-one and onto. This can be accomplished by defining Bo = B, 
and B,+41 = f(An) for each n > 0. Imitating our work with the A,,, in Exercise 
4c we prove the following. 
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Claim 3: For all n, B, D B,+1. Furthermore, if we define the prime sets 
Bi, = By — Bn41 and B’., = NnenBn, then for every y € B, there is a 
unique n € {w,0,1,2,...} such that y € Bi. 


The partitions of A and B are closely related. In particular, Exercise 4d gives hints 
for proving the following claim. 


Claim 4: For each n € N, h(Aj,,) = Bon41 and h(Aj,,,1) = Boy. Also, 
NA = Bo. 


Since h matches the A’ pieces with the B’ pieces and is one-to-one and onto on 
these pieces, / is the desired one-to-one and onto function. O 


One indication of the importance of the Cantor—Bernstein Theorem is the num- 
ber of mathematicians who have produced proofs of it. The following is a partial 
listing. According to Levy [187], Dedekind proved the theorem in 1887. Writing 
in 1895, Cantor [47] described the theorem as an easy consequence of a version 
of the axiom of choice. In the endnotes of [49], Jourdain refers to an 1896 proof 
by Schréder. Some texts, [209] for example, refer to the theorem as the Schréder— 
Bernstein Theorem. Bernstein proved the theorem without using the axiom of 
choice in 1898; this proof appears in Borel’s book [32]. Additional later proofs 
were published by Peano [220], J. K6nig [181], and Zermelo [292]. It is good 
to remember that the axioms for set theory were in flux during this period. These 
mathematicians were making sure that this very applicable theorem was supported 
by the axioms du jour. 

Now we will examine a pair of applications of the Cantor—Bernstein Theorem. 
Note that we are freed of the tedium of constructing onto maps. 


Corollary 3.21. N ~ Z. 


Proof. Define f : N > Z by f(n) = n. Note that f is one-to-one, so N X Z. 
Define g : Z — N by 


22 if z > 0, 
g(2) = 
2\z|+1 ifz<0. 


Note that g is one-to-one, so Z < N. Applying the Cantor—Bernstein Theorem, 
N~Z. oO 


Corollary 3.22. N~N x N. 
Proof. The function f : N > N x N defined by f(n) = (0,7) is one-to-one, so 
N x NXN. The function g : NxN — N defined by g(m, n) = 2”*!-3”*1 is also 


one-to-one, so N x N x N. By the Cantor—Bernstein Theorem, N~NxN. O 


Corollary 3.23. P(N) ~ R. Consequently, R is uncountable. 
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Proof. First we will prove that P(N) x R. Define f : P(N) — R by setting 
f(X) = Mnex 107” for each X € P(N). As an example of how this map 
works, if X consists of the odd natural numbers, then f(X) = 0.1010. If X and 
Y are distinct subsets of N, then they differ at some least natural number n, and 
f(X) and f(Y) will differ in the nth decimal place. Thus, f is one-to-one, and so 
P(N) XR. 

Now we must construct a one-to-one function g : R — P(N). Let r be a real 
number. If we avoid decimal expansions that terminate in an infinite sequence of 
9’s, we can assume that r has a unique decimal expansion of the form 


(-1)° {| So kilo’ + S© ajlo~? |, 


tEXy JEXe2 


where « € {0,1}, each k; and d; is between 1 and 9, X, is a set of natural 
numbers, and X49 is a set of nonzero natural numbers. In this representation, (—1)¢ 
is the sign of r, a k,,10* is the integer portion of r, and 5> d;10~/ is the 
fractional portion of r. Define the function g by setting 


JEX2 


g(r) = {e} U (1041 +k; | de X1}U {107 + dj | j € Xo} 


for each r € R. As a concrete example of the behavior of this map, consider 
g(—12.305) = {1, 1001, 12, 103, 1000005}. Since different reals differ in some 
decimal place, g is one-to-one. By the Cantor—Bernstein Theorem, P(N) ~ R. 
By Theorem 3.18, N < P(N). Together with P(N) ~ R, this implies that 
N & R, so R is uncountable. O 


Note that we did not construct a one-to-one function from P(N) onto R. The 
Cantor—Bernstein Theorem tells us that such a function must exist, so we are not 
obligated to construct it. (If you are not already convinced that existence theorems 
are tremendously convenient, try doing a direct construction for the preceding 
corollary. This is intentionally not listed in the exercises.) 


3.5.2. Ordinals and Cardinals 


The aleph was heavy, like trying to carry a small engine block. 
— William Gibson, Mona Lisa Overdrive 


For Gibson, an aleph is a huge biochip of virtually infinite storage capacity. For 
a linguist, aleph is X, the first letter of the Hebrew alphabet. For a set theorist, an 
aleph is a cardinal number. Saying that there are Xo natural numbers is like saying 
that there are five fingers on my right hand. Alephs are special sorts of ordinals, 
and ordinals are special sorts of well-ordered sets. 

Suppose that X is set and < is an ordering relation on X. We write x < y 
when z < y and x F y. The relation < is a linear ordering on X if the following 
properties hold for all x, y, and z in X. 
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Antisymmetry: («<y A y<a)> a=y. 
Transitivity: 2 <y—- (y<z—>a4<2z). 
Trichotomy:r<y V@=yVy<@. 


Familiar examples of linear orderings include N, Z, Q, and R with the typical 
orderings. We say that a linear ordering is a well-ordering if every nonempty 
subset has a least element. Since every subset of N has a least element, N is a 
well-ordering (using the usual ordering). Since the open interval (0,1) has no 
least element, the usual ordering does not well-order R. An analyst would say 
that 0 is the greatest lower bound of (0,1), but 0 is not the least element of (0, 1) 
because 0 ¢ (0,1). The following theorem gives a handy characterization of well- 
ordered sets. 


Theorem 3.24. Suppose X with < is a linearly ordered set. X is well-ordered if 
and only if X contains no infinite descending sequences. 


Proof. We will prove the contrapositive version, that is, X is not well-ordered if 
and only if X contains an infinite descending sequence. 

First suppose X is not well-ordered. Then X has a nonempty subset Y with 
no least element. Pick an element xo in Y. Since 2g is not the least element of 
Y, there is an element x; in Y such that xq > x1. Continuing in this fashion, we 


obtain x9 > 41 > X2 > ---, an infinite descending sequence. 

To prove the converse, suppose X contains tg > 21 > ®2 > ---, an infinite 
descending sequence. Then the set Y = {x; | i ¢ N} is a nonempty subset of X 
with no least element. Thus, X is not well-ordered. O 


For any set X, the € relation defines an ordering on X. To see this, for each 
z,yEe X,letx <e yifx € yorx = y. In general, this is not a particularly pretty 
ordering. For example, if a 4 b then the set X = {a,b, {a,b}} is not linearly 
ordered by the <¢ relation. On the other hand, Y = {a, {a,b}, {a, {a, b} }} is 
well-ordered by the <¢ relation. In a moment, we will use this property as part of 
the definition of an ordinal number. 

A set X is transitive if for ally € X,if x € y then x € X. A transitive set that 
is well-ordered by <¢ is called an ordinal. The ordinals have some interesting 
propertties. 


Theorem 3.25. Suppose X is a set of ordinals and a and (3 are ordinals. Then 
the following hold: 


I. UX is an ordinal. 
2. aU {a} is an ordinal. 
3.a€ Bora=BorBea. 


Proof. See Exercises 10, 13, 14, 15, and 16. O 
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The first two properties in the preceding theorem give good ways to build new 
ordinals from old ones. For example, a little vacuous reasoning shows that () is an 
ordinal. By the theorem, the sets ) U {0} = {0}, {0} U {{O}} = {0, {O}}, and 
{9, {O}} U {{O, {O}}} = {0, {0}, {0, {O}}} are all ordinals. Set theorists have 


special names for these finite ordinals. They write 
0 =0, 
{0} = {0} = 1, 
{0, {0} } = {0,1} =2, 
{0, {0}, {0, {O}}} = 10,1, 2} = 8, 


and so on for all & € N. Since each k is a set, we can define w by w = Ugenk, and 
use the first property in the theorem to see that w is an ordinal. We do not have to 
stop here. Since w is an ordinal, so is w U {w} = {w,0,1,2,...}, and we start all 
over. Sometimes, texts write a + 1 for the ordinal a U {a} and call ordinals like 
1, 2, 3, and w + 1 successor ordinals. Ordinals that are not successors are called 
limit ordinals. The set w is a good example of a limit ordinal. 

Traditionally, greek letters are used to denote ordinals. Also, we usually write 
a < (rather than a <¢ (3. Consequently, for ordinals the formula a < ( means 
the same thing as a € (3. Because ordinals are transitive, a € 3 implies a C £3, 
although the converse is not always true. 

There are three ways to think about ordinals and well-orderings. First, every 
ordinal is a well-ordered set under the < relation. Second, the class of all ordinals 
is well-ordered by the < relation. Third, every well-ordered set looks just like an 
ordinal. The next theorem is a precise expression of the way that ordinals act as 
canonical well-orderings. 


Theorem 3.26. Every nonempty well-ordered set is order isomorphic to an ordi- 
nal. That is, if X is well-ordered by <, then there is an ordinal a and a function 
h: X — a such that h is one-to-one and onto, and for all x and yin X, «x < y 
implies h(x) < h(y). 


Proof. Let X bea well-ordered set. For each x € X, define the initial segment for 
x by I, = {t © X | t < x}. Let W be the subset of X consisting of all elements 
x such that J,, is order isomorphic to an ordinal. Note that for each x € W, I, is 
order isomorphic to a unique ordinal. By the replacement axiom, we can construct 
a set A of all the ordinals isomorphic to initial segments of X. Let a = UA; by 
Theorem 3.25, a is an ordinal. If x,y € W, x < y, y and 6 are ordinals, and h, 
and h, are order isomorphisms such that hz : I; — 7 and h, : I, — 6, then for 
allt < x, h,(t) = h(t). Using the replacement axiom, we can form the set of 
all the order isomorphisms corresponding to the initial segments, and concatenate 
them to build a new function h. This new function is an order isomorphism from 
W onto a. To complete the proof, we claim that W = X. Suppose not; since 
X is well-ordered, we can find a least ¢ in X such that t ¢ W. If we extend h 
by setting h(t) = a, then h witnesses that J, is order isomorphic to a + 1. Thus 
t € W, yielding a contradiction and completing the proof. O 
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The next theorem shows that using AC we can well-order any set, widening the 
applicability of the preceding theorem. Our proof of the “well-ordering principle” 
uses ideas from Zermelo’s [291] original proof. The proof can also be viewed as 
a special case of Zorn’s Lemma. See Exercise 18 for more about Zorn’s Lemma. 


Theorem 3.27. Every set can be well-ordered. 


Proof. Let X be a set. We will construct a one-to-one map h : a — X from an 
ordinal a onto X. This suffices to prove the theorem, since the elements of a are 
well-ordered and h matches elements of a with elements of X. 

By AC we can pick x € X — t for each nonempty t C X. There are two 
things to note here. First, x; is never an element of t. This is important later in the 
proof. Second, this is the only use of AC in this entire proof. This is handy for the 
exercises. 

Suppose that f : a — Y is a one-to-one map of an ordinal a ontoaset Y C X. 
For each 3 < a, let f[G] denote {f(5) | 6 € 3}. (Remember, since (@ and a are 
ordinals, 3 < a is the same thing as 3 € a.) We say that f is a y-function if 
f(B) = x Fig) for every 3 € a. Let T be the set of all y-functions. The y-functions 
cohere nicely; if f and g are y-functions and {3 is in both of their domains, then 
f(B) = g(G). (See Exercise 17.) If we view the functions in I as sets of ordered 
pairs, UL is also a set of ordered pairs. Since the functions cohere and are one-to- 
one, UT is actually a one-to-one function; call it h. By Theorem 3.25, the union of 
the ordinals that are domains of the functions in I’ is also an ordinal, so for some 
ordinal a, h : a + X. Furthermore, h is a y-function. 

It gets better. Suppose that h does not map a onto X, so h[a] © X. Then we 
can define an extension h’ by setting h’((3) = h(3) for 8 < wand h'(a) = &pjaj- 
This extension h’ is also a 7-function, so h’ € I’. Applying the definition of h, 
we find that h’(q) is in the range of h. But h’(a@) = xp;q) and the range of h is 
h{a], so we have pio) € h[a], contradicting the statement two paragraphs back. 
Summarizing, h is a one-to-one map of a onto X, so X is well-ordered. oO 


Combining the last two theorems yields the following corollary. 


Corollary 3.28. For every X there is a unique least ordinal a such that X ~ a. 


Proof. Fix X.By Theorem 3.27, X can be well-ordered. By Theorem 3.26, X ~ 
@ for some ordinal 3. Let A = {y < 8| 7 ~ X} be the set of ordinals less than 
or equal to ( that are equinumerous with X. Since A is a nonempty set of ordinals 
well-ordered by <, A contains a least element, a. Let 6 be any ordinal such that 
X ~ 6. By Theorem 3.25, 6 < aora < 6. If 6 < a, then dé < ( and we have 
6 € A, contradicting the minimality of a. Thus a < 6, and qa is unique. Oo 


Since every set has a unique least equinumerous ordinal, we can define |X| as 
the least ordinal a such that X ~ a. We say that an ordinal « is a cardinal number 
if || = «. In slogan form, a cardinal number is the least ordinal of its cardinality. 
The finite pigeonhole principle asserts that every finite ordinal is a cardinal. Thus, 
0, 1, and 17324 are all cardinals. The infinite pigeonhole principle shows that w 
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cannot be mapped one-to-one into any finite cardinal, so w is a cardinal number; 
indeed, it is the least infinite cardinal. On the other hand, w+1 ~ wandw+2 ~ w, 
so w+1 and w+2 are not cardinals. The elements of the next larger cardinal cannot 
be matched with the elements of w, so the next larger cardinal is uncountable. 

Even though every cardinal number is an ordinal, we have special notation to 
distinguish the cardinals. When we are thinking of w as a cardinal, we denote 
it with an aleph, so w = No. The next larger (and consequently uncountable) 
cardinal is &;. Proceeding in this way, and using unions at limit ordinals, we can 
define &. for every ordinal number qa. For example, the least cardinal bigger than 
No, Ni, No,... is X,. Assuming AC, for every infinite set X, there is an ordinal a 
such that |X| = Na. 

The ordinals are like a long string of beads. The infinite cardinals, which are the 
alephs, appear like infrequent pearls along the string. The ordinals are good for 
counting steps in order (like rosary beads), and the cardinals are ideal for summing 
up sizes (like abacus beads). For finite sets, cardinals and ordinals are identical. 
Thus |{A, B,C, D}| = 4 = {0,1,2,3} and {A,B,C,D} ~ {0,1,2,3}. In 
general, the matching approach to measuring the sizes of sets agrees with the 
cardinality approach. This is formalized in the following theorem. 


Theorem 3.29. For all sets X and Y, |X| = |Y| ifand only ifX ~Y. 


Proof. Suppose |X| = |Y| = «. Then X ~ wandY ~ «,so X ~ Y. Conversely, 
suppose X ~ Y, and let «1 = |X| and kg = |Y|. Since kK) ~ X ~ Y ~ ko, we 
have K1 ~ Kg. Since &; and «2 are cardinals, Kk; ~ K2 implies Kk; = Kg. Thus 
|X| = |Y]. O 


3.5.3 Pigeons Finished Off 


Every Sunday you'll see 
My sweetheart and me, 
As we poison the pigeons in the park. 
— Tom Lehrer 


At this point, we know quite a bit about stuffing pigeons into pigeonholes. For 
example, if p and h are both finite cardinal numbers, we have the following finite 
pigeonhole principle. 


e If we put p pigeons into h pigeonholes and h < p, then some pigeonhole 
contains at least two pigeons. 


The idea here is that any function from the set of larger cardinality into the set of 
smaller cardinality must fail to be one-to-one. By the Cantor—Bernstein Theorem, 
this holds for infinite cardinals as well. Thus for any cardinals « and 4, we get the 
following analogue of the finite pigeonhole principle. 


e If we put & pigeons into \ pigeonholes and \ < «, then some pigeonhole 
contains at least two pigeons. 
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The preceding infinite analogue of the finite pigeonhole principle is not the 
same as the infinite pigeonhole principle of Theorem 3.1. Here is a restatement of 
Theorem 3.1 using our notation for cardinals. 


e If we put No pigeons into h pigeonholes and h < No, then some pigeonhole 
contains No pigeons. 


The infinite pigeonhole principle says that some pigeonhole is infinitely crowded. 
This does not transfer directly to higher cardinalities. For example, we can put X., 
pigeons into No pigeonholes is such a way that every pigeonhole has fewer than 
XN. pigeons in it. To do this, put No pigeons in the Oth hole, Xi pigeons in the 
1st hole, Xz pigeons in the 2nd hole, and so on. The total number of pigeons is 
Nw = UnewNn, but each hole contains &,, pigeons for some n < w. This peculiar 
behavior stems from the singular nature of &,,. 

A cardinal « is called singular if there is a cardinal A < « and a function 
f :A— « such that Une) f(a) = &. (Remember, « is transitive, so if f(a) € K, 
then f(a) C «.) As an example, if we define f : No > wy by f(n) = Nn, then 
Uaery f(a) = Xu, Showing that X,, is singular. Any infinite cardinal number that 
is not singular is called regular. One good example of a regular cardinal is No; it 
is not equal to any finite union of finite cardinals. We can generalize the infinite 
pigeonhole principle for regular cardinals, but to prove the new result, we will 
need the following theorem. 


Theorem 3.30. For every infinite cardinal k, 


KX Kl =k. 


We will postpone the proof of Theorem 3.30 for a while and jump right to the 
avian corollary. Since this is the last pigeonhole principle in this book, we will 
call it ultimate. Of course our list of pigeonhole principles is not all inclusive. For 
example, more set theoretic pigeonhole principles are given in [72]. 


Corollary 3.31 (Ultimate Pigeonhole Principle). The following are equivalent: 
1. «is a regular cardinal. 


2. If we put & pigeons into X < k pigeonholes, then some pigeonhole must 
contain k, pigeons. 


Proof. First suppose that \ < « and «& is regular. Suppose that g : & — is an 
assignment of « pigeons to \ pigeonholes. Define f(a) = |{x € « | g(x) = a}| 
for each a < A, so f(a) is the population of the ath pigeonhole. Suppose, by 
way of contradiction, that f(a) < « for each a. Because each f(a) is a cardinal, 
ft = Uae f(a) is a cardinal. Furthermore, p. < « because « is regular. For each 
a, f(a) < py, so the population of the ath pigeonhole can be matched with a 
subset of jz. Since there are \ pigeonholes, the entire pigeon population can be 
matched with a subset of 1 x A, sow < |x Al. Lety = max{y, A}. Since pp < v 
and A < y, |u x A| < |v x v|. By Theorem 3.30, |v x v| = v. Since wu < « and 
A < K, we have v < «, and concatenating inequalities yields 


K<|uxaAl<|luxvl=v<k, 
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a contradiction. Thus, for some a, f(a) = « and the ath pigeonhole contains « 
birds. 

To prove the converse, suppose that « is a singular cardinal. Then there is a 
cardinal A < « and a function f : \ — «# such that Uge, f(a) = «. Define 
g:«& — Aby letting g(Z) be the least a such that 3 € f(a). Since Us<a, f(a) =k 
and A is well-ordered, g is well-defined and maps each element of « to an element 
of \. Furthermore, for each a < A, 


{8 € «| g(@) = a}| <|f(a)| < x. 


Thus g can be viewed as an assignment of « pigeons to \ pigeonholes so that the 
population of each pigeonhole is less than «. Oo 


The first part of the preceding proof can be adapted to prove that lots of cardi- 
nals are regular. This is a nice fact, since it means that we can apply the pigeonhole 
principle in lots of situations. 


Corollary 3.32. For each ordinal a, the cardinal Xq+1 is regular. 


Proof. We will sketch the argument. Suppose f : A4 — Na+i1, where 2 is any 
cardinal such that \ < Na4i. Then A < Na, and for each 3 < A, |f(G)| < Xa. 
Applying Theorem 3.30 yields | Uv<ey f(a@)| < |Na X Nal = Na < Nay. O 


We should list the regular cardinals we have found. No is regular, and by the 
preceding corollary so are Xy(= o+1), Ne(= N141), X3, Na, and so on. We have 
seen that the limit cardinal X,, is singular; the subscript cannot be written as a+1, 
so this does not contradict Corollary 3.32. However, X.,+1 is regular, as are Xw+2, 
XN.w+3, and so on. Our only good example of a regular limit cardinal is No. We do 
not have an example of an uncountable regular limit cardinal. The reason for this 
is explained in Section 3.6. 

It seems that Theorem 3.30 is a handy way to bound the sizes of unions. Here 
is a nice way to capsulize that. 


Corollary 3.33. [f « is an infinite cardinal and |X| < « for each a < k, then 
| Ua<w Xal < #. In particular, a countable or finite union of at most countable 
sets is at most countable. 


Proof. Suppose |X| < « for each a < x. For each a, let ga : Xq — & be a one- 
to-one map. Define f : UgenXa > KX K by f(x) = (a, ga(x)), where a is the 
least ordinal such that x € X,,. The function f is one-to-one, so Ugexn.Xaq SB KXK. 
Thus by Theorem 3.30, | Ua<x Xa| < |& X &| = K. To prove the particular case, 
let K = No. O 


We have used Theorem 3.30 repeatedly, but still have not proved it. It is time 
to pay the piper. 


Proof of Theorem 3.30. We will use induction to prove that |« x «| = « for every 
infinite cardinal «. For the base case, apply Corollary 3.22 to get |No x No| = No. 
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As the induction hypothesis, assume | x A] = A for every infinite cardinal \ < k. 
Since & X Kk X &, by the Cantor—Bernstein Theorem, it suffices to show that 
KXAN K. 

Define the ordering < on & x #« as follows. Let (a, 3), (a’, 6’) € & x «and let 
pe = max{a, 3} and yw’ = max{a’, 3’}. We say that (a, 3) < (a’, 3’) if and only 
if 


pe <p’, or 
p= p! anda <a’, or 
w= pw anda=a’' andB < pf’. 


Informally, this relation sorts « x & by looking at maxima, then first elements, 
and then second elements. A routine but technical argument shows that < is a 
well-ordering of & x «. By Theorem 3.26, & x « under < is order isomorphic 
to some ordinal. Let 6 denote that ordinal, and let f : & x « — 6 be the order 
isomorphism. If 6 < «, then k x «& S 6 XS « and the proof is complete. 

Suppose by way of contradiction that « < 6. Since 6 is an ordinal, we know that 
& € 0, so there is an element (a,7) € & x « such that f(o,7) = «. Let p denote 
max{o,T} and note thato < &,7 < «, and consequently 1 < «. Furthermore, 
by definition of the well-ordering on & x k, 


{(a,B) EKxXK 


sok Nw x pandsr < |p x pl. Let A = |p|. Since X ~ ps, we have |u x p| = 
|X x Aj. Since zp < «, A is a cardinal less than «, so by the induction hypothesis, 
|A x A| = A < k. Concatenating inequalities yields K < | x | = |A x A] < &, 
a contradiction that completes the proof. 


f(a, 8) <K}C ux p, 


Exercises 
1. Define f : N— Z by f(n) = (—1)"**(4)(2n + 1) + (4). 
(a) Show that f is one-to-one. (Assume f(j) = f(k) and prove j = k.) 


(b) Show that f is onto. (Show that ifm > 0, then f(2m — 1) = mand 
if t < 0 then f(—2t) = t.) 


2. Prove that ~ is an equivalence relation. That is, show that for all sets A, B 
and C’, the following hold: 


(a) AW A, 
(b) AW B= B~ A, and 
(c) (AN BAB~C)> AC. 


3. Show that for all sets A, B, and C, the following hold: 


(a) A x A (so J is reflexive), and 
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4. 


10. 


11. 


12. 
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(b) (AX BABXC) > AX C (80 Nis transitive). 


Details of the Cantor—Bernstein proof. This problem uses notation from the 
proof of Theorem 3.20. 


(a) Use induction to prove Claim 1. As a base case, show Ag D Ay D Ag. 
For the induction step, assume A, D Any, D An +2 and show that 
An+2 D An+3, using the fact that A, > A,+1 implies go f(A,) D 
9° f(An+1). 

Prove Claim 2. Use the fact that either 2 € A‘, or there is a least 7 
such that « ¢ A, to show that each x is in some A/,. (Here n is a 
natural number or w.) To prove that x is in a unique A’, suppose that 
x is in two such sets, and seek a contradiction. 


Prove Claim 3. Use Claim | to get a short proof that B,, > B,4+1. The 
remainder of the argument parallels the proof of Claim 2. 


(b 


wm 


(c 


wm 


(d 


wm 


Prove Claim 4. To show that h(A5,,) = B3,,,1, note that because f is 
one-to-one we have f(A2,) — f(Agn—1) = f(Aan — Agn—1), and so 


Bias = Bon+41—Ban = f (Aan)—f(Aan-1) = f (Adan) — h(Ap,): 


The proof that B5,, = h(A5,,,,) is similar. For the limit, the proof 
of h(Al,) = BI, relies on the fact that since f is one-to-one, we must 
have that f(QnewAn) = Onewf (An). 


. Let Q denote the set of rationals. Prove that Q ~ N. 


. Let Seq denote the set of all finite sequences of natural numbers. Prove that 


Seq ~N. 


. Without using Theorem 3.24, prove that Z with the usual ordering is not 


well-ordered. 


. Using Theorem 3.24, prove that Z with the usual ordering is not well- 


ordered. 


. Repeat Exercises 7 and 8 for the set Qt = Q/N [0,co) with the usual 


ordering. 
Suppose that X is a transitive set. 


(a) Prove that X’ = X U {X} is transitive. 
(b) Prove that UX is transitive. 


Give an example of a nontransitive set where <¢ is a transitive relation. 
(Hint: The relation <¢ is vacuously transitive on every two-element set.) 


Give an example of a transitive set where <¢ is not a transitive relation. 
(Hint: There is an example with three elements.) 


13. 


14. 


15; 


16. 
17. 


18. 
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Prove that if a is an ordinal, then so is a’ = a U {a}. (Hint: Exercise 10a 
shows that a’ is transitive. Use the fact that a is well-ordered by <¢ to 
show that a’ is too.) 


Prove that if a and ( are ordinals, then aN @ = a oraNG = £. (Hint: Let 
C =a and suppose that C # a and C ¥ (3. Let y be the least element 
of a such that 7 ¢ C. Show that y = C, so C € a. Similarly, C € G, so 
C Ean =C, contradicting Theorem 3.13.) 


Prove that if X is a set of ordinals, then U_X is an ordinal. (Hint: To show 
that UX is transitive, note that if « € y € UX, then for some ordinal z © X 
we have y € z. The fact that <c well-orders each element of X helps 
in proving antisymmetry and transitivity. Exercise 14 is useful in showing 
that trichotomy holds. Use the axiom of regularity to show that UX has no 
infinite descending sequences.) 


Prove that if a and @ are ordinals, then a € 3,a = G,or Gea. 


Details from the proof of Theorem 3.27. 


Let f : @, — X andg: G2 — X be 7-functions as defined in the proof 
of Theorem 3.27. Prove that if G € (3; M (a, then f(3) = g(@). (If f 
and g disagree, then there is a least 3 such that f(@) 4 g(@). For this @, 
fC] = g[G]. Apply the definition of a y-function.) 

Zorn’s Lemma and AC. 


Prove in ZF that the following are equivalent: 


1. AC. 


2. Zorn’s Lemma: Let P be a partial ordering (transitive and antisym- 
metric) such that every chain (linearly ordered subset) has an upper 
bound in P. Then P contains a maximal element (an element with no 
elements above it.) 


3. Every set can be well-ordered. 


(a 


wm 


Prove that | implies 2. (Hint: Emulate the proof of Theorem 3.27. 
Suppose P has no maximal elements. For each chain C, let xc be an 
upper bound of C’ that is not an element of C’. Call C' a y-chain if 
for every p € C, Xtyec | y<p} = Pp. Use the union of the 7-chains to 
derive a contradiction.) 


Prove that 2 implies 3. (Hint: Fix a set X to well-order. Let P be 
the set of all one-to-one maps from ordinals to subsets of X. P is 
partially ordered by function extension. Show that every chain has an 
upper bound. Show that a maximal element maps an ordinal one-to- 
one onto X.) 


(b 


wm 


(c) Prove that 3 implies 1. 
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19. Describe a way to place No pigeons in No pigeonholes so that each pigeon- 
hole contains at most one pigeon and No of the pigeonholes are empty. 
(Hint: |Z| = |N| = No.) 


20. Describe a way to place No pigeons in No pigeonholes so that each pigeon- 
hole contains No pigeons. (Hint: |No x No| = No.) 


21. Describe a way to place No pigeons in No pigeonholes so that for each 
Kk < No exactly one pigeonhole contains exactly « pigeons. 


22. Describe a way to place No pigeons in No pigeonholes so that for each 
kK < No exactly No pigeonholes contain exactly « pigeons. 


23. Show that if we put « pigeons into \ pigeonholes and « < 4, then X pi- 
geonholes will remain empty. (Hint: The set of all pigeonholes is the union 
of the empty holes and the occupied holes.) 


3.6 Incompleteness and Cardinals 


... we never assumed that (ZFC) included all the “true” facts. 
— Levy [187] 


In the last section we noted the absence of an example of an uncountable regular 
limit cardinal. We do not have an example because in ZFC we cannot prove that 
uncountable regular limit cardinals exist, assuming that ZFC is consistent. We 
have to assume that ZFC is consistent, because ZFC cannot prove that either. 
The last sentence is essentially Gédel’s Second Incompleteness Theorem and the 
starting point for our exploration of large cardinals. 


3.6.1 Gdodel’s Theorems for PA and ZFC 


Gédel’s 1931 (paper) was undoubtably the most exciting and the 
most cited article in mathematical logic and foundations to appear 
in the first eighty years of the (twentieth) century. 

— Kleene [126] 


Gédel’s First Incompleteness Theorem [124] is a statement about the provability 
of a formula in formal Peano Arithmetic (PA). In a nutshell, Gédel’s first theorem 
says that if PA is w-consistent, then there is a formula G such that PA does not 
prove G and PA does not prove 4G. In order to make this clear, we should discuss 
PA, w-consistency, and the formula G. 

The axioms of PA are an attempt to describe the important properties of the 
natural numbers under the operations of successor (adding one), addition, and 
multiplication. PA includes predicate calculus and axioms that say that 


e (is not the successor of any element, 
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ex+0=axandx2+(n+1)=(e#+n) +1, 
e «:-0=Oanda-(n+1)=a2-n+a, and 
e the distributive laws hold. 


PA also includes an induction scheme that can be used to prove a wealth of facts 
about the natural numbers. At one point, it was thought that PA might be able to 
prove every true statement about N, but then Gédel’s work ruled that possibility 
out. 

We say that a theory is consistent if there is no formula A such that the theory 
proves both A and —A. A theory that is inconsistent and includes predicate calcu- 
lus can prove every formula. Thus a theory is consistent if and only if there is some 
formula the theory cannot prove. When we say PA is w-consistent, we mean PA 
cannot prove both 47 A(x) and every formula in the list =A(0), —A(1), —A(2), 
and so on. Assuming w-consistency is very reasonable, but a little stronger than 
assuming regular old consistency. Rosser [242] devised a way to prove Gédel’s 
first theorem assuming only the consistency of PA. His replacement for Gédel’s 
sentence G is a slightly more complicated formula. 

Informally, Gédel’s formula G says “there is no proof in PA of the formula G.” 
This is encoded in the language of arithmetic. Given our daily exposure to word 
processors and automated spelling and grammar checkers, we are used to the idea 
that formulas and lists of formulas (like proofs) can be represented as strings of 
zeros and ones, and that such strings can be viewed as integers and described by 
arithmetical formulas. It is very remarkable that Gédel devised and utilized an 
encoding scheme in 1931, long before the advent of electronic computers. 

The method for making G refer to G is very entertaining. Let Go(a) be the 
formula that says “there is no number that encodes a proof in PA of the formula 
obtained by substituting the number « for the free variable in the formula encoded 
by the number «.” Suppose that n is the number that encodes Go(x). Note that the 
formula obtained by substituting n for the free variable in the formula encoded 
by n is exactly Go(n). Informally, Go(n) says “there is no number that encodes 
a proof in PA of Go(n).” Thus, Go(n) is the desired formula G. 

Once we have the encoding procedures in hand and have created the formula G, 
the remainder of the proof of Gédel’s First Incompleteness Theorem is straight- 
forward. Suppose that PA is w-consistent. First, suppose that PA proves G. Then 
this proof is encoded by a number n and PA proves that “there is a number that 
encodes a proof in PA of G.” Thus, PA proves 7G, contradicting the consistency 
of PA. Now suppose that PA proves —=G. Then PA proves that “there is a number 
that encodes a proof of G.” By the w-consistency of PA, we can find some num- 
ber that actually does encode such a proof, and so PA proves G. Again, we have 
contradicted the consistency of PA. 

Gédel’s Second Incompleteness Theorem [124] says that if PA is consistent, 
then there is no proof in PA that PA is consistent. Much of the machinery used for 
the first theorem applies here also. The formula that asserts that PA is consistent, 
Conpa, 1s an encoding of the sentence “there are no numbers « and y such that « 
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encodes a proof in PA of a formula and y encodes a proof in PA of the negation of 
that formula.” It is possible to prove in PA that Conp, — G. Thus, if PA proved 
Conpa, then PA would prove G, and that contradicts Gédel’s First Incompleteness 
Theorem. 

Perhaps the most remarkable quality of the incompleteness theorems is the 
ubiquity of their applicability. The theorems utilize only a few important features 
of PA and therefore apply to a wide variety of formal theories. In particular, the 
proofs of the theorems rely heavily on the ability to carry out a modest amount of 
arithmetic and the ability to check proofs in a mechanical fashion. Consequently, 
if a theory has enough axioms (to prove facts about encoding) but not too many 
axioms (so proof checking is not incomprehensibly complicated), then both in- 
completeness theorems apply. For example, both incompleteness theorems hold 
for ZFC. Thus, assuming that ZFC is consistent, there is a formula Z such that 
ZFC proves neither Z nor —=Z, and ZFC does not prove Conzgc. The incomplete- 
ness theorems also hold for ZF and for any extensions of ZF by a finite number 
of axioms. 


3.6.2 Inaccessible Cardinals 


Better to reign in L, than serve in Heav’n. 
— Milton, Paradise Lost (slightly misquoted) 


If & is an uncountable regular limit cardinal, then we say & is weakly inaccessible. 
Our goal is to prove that the existence of weakly inaccessible cardinals is not 
provable in ZFC. At the end of this section we will link this back to our study of 
pigeonhole principles. The plan for achieving the goal is straightforward. The first 
step is to prove in ZFC that if there is a weakly inaccessible cardinal, then ZFC is 
consistent. Then we apply Gédel’s Second Incompleteness Theorem and get the 
desired result. The first step requires a journey to L, the constructible universe. 

We will build the constructible universe in stages. Let Lo = ). If Lg is defined, 
let La+1 be the set of all subsets of LZ, that are definable by restricted formulas 
with parameters from L,,. To be precise, a set X will be placed in La+) if all of 
the following conditions hold. 


eXClau, 
© U1,U2,...,Un € La is a finite list of parameters, 
e wis a formula in the language of set theory, 


e w does not contain the power set or union symbols, 


e each quantifier in ~ is of the form dz € Ly or Vx € La, and 


X= {y € La | wy, U1, U2, fbad Un) }. 
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If q@ is a limit ordinal, let La = UgcaLlg. This suffices to define L, for each 
ordinal number a. Note that for each a, Lg is a set. The constructible universe, L, 
is the class defined by L = UL, where the union ranges over all ordinal numbers. 
Neither {Z, | a is an ordinal number} nor L itself are sets, but it is convenient to 
refer to them using set-theoretic notation. 

The finite levels of the constructible universe are simple in structure. For each 
k <w, each Ly is finite and Ly41 = P(Lx). By definition Lo = 9, and so Ly = 
P(O) = {0} and Lo = P({O}) = {0,{O}}. Rewriting with ordinal notation, 
Lo = 0, £1 = 1, and Lz = 2. At the next level, D3 breaks this pattern, since 
L3 = P(L2) = {0,1, {1}, 2}, which is not an ordinal. L,, is defined by a union, 
so every element of L,, is an element of L;, for some k < w. Beyond L,,, the sets 
become vastly more complicated very rapidly. If « is weakly inaccessible, the L,, 
is an abridged version of the entire universe. This is stated more formally in the 
following theorem. 


Theorem 3.34. [f « is weakly inaccessible, then L,, is a model of ZFC. That is, if 
we restrict the quantifiers in the axioms to L,,, then the axioms of ZFC all hold. 


Comments on the proof. For a detailed treatment, see [80] or [73]. We provide 
only a hint of some of the main issues in the proof. The basic idea is to verify 
that the axioms of ZFC hold in V,, in much the same way that one would show 
that the axioms defining vector spaces hold in R°. Some axioms are very easy to 
manage. For example, @ € L1, so the empty set axiom holds. If a and 6 are in La, 
then {a, b} is in Ly+1, so some instances of the pairing axiom are easy to verify. 
Verification of the infinity axiom relies on « being larger than No, since every set 
in Lx, is finite. The verification of the power set axiom is particularly tricky and 
relies on the assumption that « is a regular limit cardinal. O 


Now we can finish our proof of the unprovability of the existence of a weakly 
inaccessible cardinal. The proof relies on the fact that any set of axioms with a 
model must be consistent. If 1/ is any model for a set of axioms T’, then every 
theorem that can be proved from T is true in /. (This is actually what makes 
proofs useful. If you prove a theorem from the axioms for vector spaces, then it 
has to be true in every vector space.) If T’ is inconsistent, then it proves a contra- 
diction that would have to be true in M. But models are concrete (think of IR? as 
a model for the vector space axioms), so no contradiction can be true in a model. 
For a more technical discussion of models, truth, and provability, see [201]. 


Theorem 3.35. If ZFC is consistent, then ZFC does not prove the existence of a 
weakly inaccessible cardinal. 


Proof. Assume that ZFC is consistent. Suppose, by way of contradiction, that 
ZFC proves that there is a cardinal « that is weakly inaccessible. Then ZFC proves 
that the set L,, exists. By Theorem 3.34 we know that L,, is a model of ZFC, 
so ZFC is consistent. Thus ZFC proves the consistency of ZFC, contradicting 
Gédel’s Second Incompleteness Theorem and completing the proof. O 
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The unprovability of the existence of weakly inaccessible cardinals is not like 
the unprovability of AC in ZF. If we write J for the statement “there is a weakly 
inaccessible cardinal” and write ZFC | I for “ZFC proves J,” then the preceding 
theorem says that ZFC | I, provided that ZFC is consistent. To show that I 
is independent of ZFC (like AC is for ZF) we would also need to prove ZFC 
¥ aI assuming that ZFC is consistent. Thanks to Godel, we know that this is an 
unattainable goal. To see this, suppose (for an eventual contradiction) that from 
Conzgc we can prove in ZFC that ZFC  7/. If ZFC Vf -I, then Conzpc+r. 
Thus, our hypothesis boils down to ZFC F Conzpc — Conzpc+r. Since ZFC+I 
is an extension of ZFC, we have ZFC+I - Conzpce — Conzpc+r1. Theorem 
3.34 shows that ZFC+J + Conzpc. Concatenating the last two lines, we obtain 
ZFC+I + Conzgc+r7, contradicting Gédel’s Second Incompleteness Theorem for 
ZFC+I. 

Finally, we should summarize the combinatorial implications of this section. 
We know that if ZFC is consistent, then it cannot prove the existence of a weakly 
inaccessible cardinal. Also, & is weakly inaccessible if and only if it is an uncount- 
able regular limit cardinal. By the Ultimate Pigeonhole Principle, « is regular if 
and only if whenever « pigeons are placed in fewer than « pigeonholes, then some 
hole contains & pigeons. So a weakly inaccessible cardinal is an uncountable limit 
cardinal with this pigeonhole property. In Section 3.5 we proved that No and Xa+1 
for each ordinal a have this pigeonhole property. In this section we have proved 
that we cannot prove the existence of any more cardinals with this property. 


3.6.3 A Small Collage of Large Cardinals 


All for one ... and more for me. 
— Cardinal Richelieu in The Three Musketeers 


A cardinal number is said to be large if there is no proof in ZFC of its existence. 
We just met our first large cardinal, the weakly inaccessible cardinal. There are 
many other large cardinals related to combinatorial principles. This section lists 
the ones we need for the next two sections. 

Recall that a weakly inaccessible cardinal is an uncountable regular limit car- 
dinal. We say that a cardinal & is a strong limit if for every 1 < Kk we have 
|P(A)| < «. An uncountable regular strong limit cardinal is called strongly in- 
accessible (or just inaccessible). Every strongly inaccessible cardinal is weakly 
inaccessible. If we assume the generalized continuum hypothesis (GCH), then 
every weakly inaccessible cardinal is also strongly inaccessible. (See Exercise 
6.) The hypothesis GCH is independent of ZFC, and has inspired a great deal of 
interesting work [173]. 

We say that « is weakly compact if « is uncountable and « — («)3. This arrow 
notation is the same used in Section 3.2, so « — («)3 means that if we color the 
edges of a complete graph with « vertices using two colors, then it must contain 
a monochromatic complete subgraph with « vertices. These cardinals reappear in 
the next section. 
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We will also look at some results concerning subtle cardinals, another type of 
large cardinal defined in terms of colorings of unordered n-tuples. Suppose that « 
is a cardinal and let [«]” denote the set of n-element subsets of «. We say that a 
function S': [K]” — P(K) is an (n, «)-sequence if for each element of [«]” of the 
forma, < ag <-++: < Qn < K, we have S({a1,@2,...,Qn}) C ay. A subset 
C C «is closed if the limit of each sequence of elements in C’ is either « or in 
C’.. The subset C' C & is unbounded if for each a © &, there is a 3 € C such 
that a < 3. We abbreviate closed and unbounded by writing club. The cardinal « 
is n-subtle if for every (n, &)-sequence S and every club set C C x, there exist 
elements 31, 32,..., 8n41 © C such that 


S81, B2,---, Bn) = B10 S(Ba, 83, --+5 Bn41)- 


The basic idea is that given a coloring S and a large set C’, the large set must 
contain some elements that are monochromatic. The n-subtle cardinals are closely 
related to n-ineffable cardinals. More information on both these types of cardinals 
can be found in [18], [19], [80], and especially [157]. 

One variation on coloring n-tuples for fixed values of n is to color n-tuples for 
alln € w simultaneously. Let [x]<“ denote the set of all finite subsets of «. We say 
k& is a Ramsey cardinal and write k — («K)5“ if for every function f : [«]<“ 
there is a set X of size « such that for each n, f is constant on [X]”. Note that 
the same X works for all n, though when j # k the j-tuples may not be the same 
color as the k-tuples. 

One type of cardinal often mentioned in the literature is bigger than anything 
we have listed so far. We say that « is a measurable cardinal if there is a K-additive 
two-valued measure on k. Roughly, this means that there is a way of assigning a 
value ju(X) to each X C x« so that yp acts a lot like the measures that appear in 
analysis. 

One way to organize all these cardinals is by comparing the sizes of the least 
example of each type of cardinal. Suppose we assign letters as follows: 


2 


W: Least weakly inaccessible cardinal, 
I: Least strongly inaccessible cardinal, 
C: Least weakly compact cardinal, 

S1: Least 1-subtle cardinal, 

S2: Least 2-subtle cardinal, 


Sn: Least n-subtle cardinal, 


R: Least Ramsey cardinal, 
M: Least measurable cardinal. 


Then we have the following relationships: 


W<I<C<S1l<S2<-:--<Sn<---<R<M. 
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The proofs of these relationships are frequently nontrivial. Good references in- 
clude [80] and [168]. 
Exercises 

1. For each k € w show that |Ly41| = 2!4+!. 

2. Using Exercise 1, prove that L,, is countable. 


3. Prove that for each ordinal a, La is transitive. (Hint: L, 1s transitive if 
x €y € La implies x € L.. Use induction on the ordinals.) 


4. Prove that if a < 3, then Ly C Lg. 


5. Prove that if « is a limit cardinal and x € Lg for some @ < k, then we also 
have Ux € L,,. 


6. The generalized continuum hypothesis (GCH) asserts that for every ordinal 
a, |P(Na)| = Na+i. Assuming GCH, prove that every limit cardinal is 
a strong limit cardinal. As a corollary, show that GCH implies that every 
weakly inaccessible cardinal is strongly inaccessible. 


7. Construct a 2-coloring f of [w]<“ such that f is constant on [w]” for each 
n, but no pair has the same color as any triple. 


3.7 Weakly Compact Cardinals 


Watch out for that tree! 
— George of the Jungle theme song 


Theorem 3.3 says that if we 2-color a complete graph G with No vertices, then 
it must contain a monochromatic subgraph with No vertices. In arrow notation, 
this is written as No — (No)3, where the first No is the size of G and the second 
No is the size of the desired monochromatic subgraph. It would be nice to know 
what other cardinals & satisfy « — («)2. We call an uncountable cardinal with 
this property weakly compact. Judging from the next theorem, our list of weakly 
compact cardinals may be very short. 


Theorem 3.36. |R| 4 (%1)3 and consequently, %, 7 (1)3- 


Proof. Let « = |R| and let g : « — R be a matching between the ordinals less 
than « and the reals. Let G be a complete graph with « vertices. We can think of 
each vertex of G as having two labels, an ordinal a < « anda real number g(q). 
Color the edges of G using the scheme 


red ifa< Bo g(a) < g(f), 


xa) = fe ifa < B= g(B) < g(a). 
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Informally, y colors the edge af red if the order on the ordinal labels agrees with 
the order on the real number labels, and colors the edge blue if the orders disagree. 

Suppose that S is a subgraph of G and |S| = &1. We will show that S is not 
monochromatic. Since the ordinal labels for the vertices of S are a well-ordered 
subset of K, we can list them in increasing order as (a | y < 81). We consider 
two cases. 

First, suppose that S is red. Then the ordering on the real labels of the vertices 
of S agrees with the ordering on the ordinal labels. This gives us an uncountable 
well-ordered increasing sequence of reals, (g(a) | y < Xi). Using the fact that 
Q is dense in R, for each y € Ni, choose a rational q, such that g(a,) < qy < 
g(ay41). Then (q, | y < 1) is an uncountable sequence of distinct rationals, 
contradicting the countability of Q. (See Exercise 5 in Section 3.5.) Thus, S is not 
red. 

Second, suppose that S'is blue. This yields an uncountable decreasing sequence 
of reals. By choosing q, such that g(ay) > gy > g(ay+41) we obtain another 
contradiction in the same fashion as in the preceding case. 

Summarizing, S is neither red nor blue. Thus G contains no monochromatic 
subgraph of size X1. 

To prove the last statement in the theorem, we note that by Theorem 3.18 and 
Corollary 3.23, N < R. Thus No < |RJ, and so |R| > 4. Since there is a way 
to color a graph with |R| vertices so that no &j-sized monochromatic subgraphs 
exist, we can certainly do the same for a (possibly smaller) graph with a mere Ny 
vertices. O 


Summarizing, we know that No — (No)3, but Ni 4 (N1)3. We can general- 
ize Theorem 3.36 to show that Xa+1 7% (Na+1)3, eliminating all the successor 
cardinals from our hunt for weakly compact cardinals. 


Theorem 3.37. |P(Xa)| 4 (Na+1)3- 


Proof. Imitate the preceding proof using P(,,) in the role of R. To do this, prove 
and use the fact that when P(X,,) is ordered by the relation 


X <Y ifandonlyif min((X —Y)U(Y —X))eY, 
it contains no increasing or decreasing sequences of size Xq41. O 


At this point we know that any weakly compact cardinal must be an uncount- 
able limit cardinal. In the following theorem, we emulate Erddés and Tarski [97] 
in showing that any weakly compact cardinal is inaccessible, and therefore large. 
Their studies of weakly compact cardinals were motivated by questions in infinite 
combinatorics stated in their 1943 paper [96]. 


Theorem 3.38. [f « is weakly compact, then k is strongly inaccessible. 


Proof. Suppose that & — (K)3. We need to show that « is regular and a strong 
limit cardinal. 
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First suppose that « is not regular, so for some A there is a function f : A > k 
such that Ua<) f(a) = «. We may assume that f is increasing. We will use f to 
construct a 2-coloring of a complete graph with « vertices. For each a < 3 < k, 
color the edge a3 using the scheme 


red if do(a < f(y) < 8), 
blue otherwise. 


x(aZ) = 
Informally, f chops « into A intervals. An edge is red if it connects two intervals. 
Thus, no red subgraph can be larger than size \. Also, each interval chopped out 
by f is smaller than «, so there is no blue subgraph of size «. This contradicts 
k — (K)3, proving that no function like f exists and that « is regular. 

Now suppose that 8. < «. If « < |P(Nq)|, then by Theorem 3.37, h A 
(No+1)3- But Nai < &, So this implies that h A («)3, contradicting weak 
compactness. Thus, if Ny < «, then |P(Nq)| < «, proving that « is a strong limit 
cardinal. O 


We have seen that if « is uncountable and « — («)3, then « is a large cardinal. 
By the results in Section 3.6, we know that ZFC cannot prove that such cardinals 
exist. Interestingly enough, increasing the number of colors (to any value less that 
&) or increasing the size of the n-tuples (from 2 to any n < w) does not lead 
to larger cardinals. That is, if \ < «,n © w, and « is weakly compact, then 
kK — (k)%. (Details are left as exercises.) We could color n-tuples for all n € w, 
and look for a single set that is monochromatic for n-tuples for each n. This leads 
to the Ramsey cardinals mentioned in Section 3.6, which are considerably larger 
than the weakly compact cardinals. 

In Section 3.1 we saw a close relationship between Ramsey’s Theorem and 
K6nig’s Lemma. This relationship reappears at higher cardinalities. We say that 
a cardinal « has the tree property if whenever T is a tree with & many nodes 
and every level of T’ has size less than «, then 7’ must have a path of size k. 
KG6nig’s Lemma (Theorem 3.2) says that No has the tree property. The next theo- 
rem shows the connection between weakly compact cardinals and cardinals with 
the tree property. 


Theorem 3.39. «& is weakly compact if and only if & is strongly inaccessible and 
has the tree property. 


Pointers: Excellent proofs of this result can be found in the books of Drake (The- 
orems 3.5 and 3.7 in Chapter 7 of [80]), Jech (Lemma 29.6 in Chapter 5 of [168]), 
and Roitman (Theorem 36 in Chapter 7 of [241]). O 


Theorem 3.39 does not characterize the smaller cardinals with the tree property. 
As we have already noted, No has the tree property. However, ZFC proves that X; 
does not. There is a tree T’ such that |7'] = Nj, the cardinality of each level of 
T is less than 8, and T has no paths of size X81. Such a tree is called an Xj- 
Aronszajn tree, and stands as a counterexample to XN; having the tree property. 
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The existence of an N2-Aronszajn tree is deducible from GCH in ZFC. On the 
other hand, Silver has shown that from the existence of a weakly compact cardinal 
we can prove the consistency of ZFC and “Ng has the tree property.” Mitchell and 
Silver have a number of other results pertaining to the tree property. Finally, using 
the ordered list of cardinals that appears in Section 3.6, if « is the least strongly 
inaccessible cardinal, then & is strictly less than the least weakly compact cardinal, 
so by Theorem 3.39, « cannot have the tree property. Thus if « is the least strongly 
inaccessible cardinal, then there is a «-Aronszajn tree. 


Exercises 


1. Prove that if « > («)3, then k — («)? for every \ < kK. 


2. Prove that if « — (K)} for each A < x, then K — (K)3. 


3. Assuming GCH, show that if « is the least weakly inaccessible cardinal, 
then there is a k-Aronszajn tree. 


4. Find a proof that there is an 8-Aronszajn tree. (Hint: A library is a good 
place to look for proofs.) 


3.8 Infinite Marriage Problems 


Infinite matching theory may seem rather mature and complete as it 
stands, but there are still fascinating unsolved problems... 
— Reinhard Diestel [75] 


We have considered the problem of matchmaking in the guise of graph matchings 
in Section 1.7, systems of distinct representatives in Section 1.7.2 and Section 3.4, 
and stable marriages in Section 2.9. Now we will study formulations of infinite 
marriage problems, expressed in some anthropocentric terminology. 

Suppose M is a set of men. For each man m € M, let W(m) denote the 
women on his list of potential wives. For a set S C M, we will write W(S) = 
UmesW(m) for the combined lists of all the men in S'. We will call the ordered 
pair (7, W) a society. A society is espousable if there is a one-to-one function 
f : M — W(M) such that for every m € M we have f(m) € W(m). We 
are requiring that f is an injection, so polygamy is disallowed. Every man must 
marry someone on his list, but some women may be left unmarried. Implicitly we 
assume that the collections of men and women are disjoint, so MM W(M) = 0. 

Some readers may argue that this terminology is quaint or sexist. We use it 
in deference to earlier authors (e.g. [2]) and because it makes the concepts very 
concrete and clear. Technically any matching application could be substituted, 
such as callers and circuits, readers and books, or pigeons and single occupancy 
pigeon holes. 

Our goal is to determine exactly which societies are espousable. We can address 
certain situations by applying the theorems of various Halls. 
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3.8.1 Hall and Hall 


She made it perfectly plain that she was his... 
— Hall and Oates 


If we have some group of seven men whose combined lists contain only five 
women, we have no hope of finding a wife for every man. In general, given any 
society (IZ, W) with a subpopulation S C M such that |S] > |W(S)|, we know 
that the society is not espousable. When / is finite, this condition is the only 
possible barrier to solving a marriage problem, as shown by the following theorem 
of Philip Hall [147]. (An alternate form of this theorem was published earlier by 
D. K6nig [180].) 


Theorem 3.40. Jf (14,W) is a society and |M| < Xo, then the following are 
equivalent: 


1. Forevery S Cc M, 


S| <|W(S)|. 
2. (M,W) is espousable. 


Proof. Given a society (MM, W), construct the graph G consisting of a vertex for 
each person and an edge from each man to each women on his list. Thus the vertex 
set is MM UW(M) and the set of edges is {(m,W(m)) | m € M}. This graph 
is bipartite; separate the vertices by gender. If the cardinality condition in item | 
holds, then by Theorem 1.51 there is a matching of the men into the women, so 
(M, W) is espousable. To prove the converse, suppose that (1/7, W) is espousable 
and that f is the injection of M7 into W(M/) matching each man to his wife. Then 
for each S Cc M, f restricted to S is an injection of S' into W(S), so by the 
definition of cardinality we must have |S] < |W(S)|. O 


The preceding theorem is usually viewed as a result about finite societies, but 
the only requirement is that the population of men is finite. The number of women 
could be anything (e.g. a woman for every real number) and the theorem still 
holds. Thus, as long as the number of men is finite, this theorem completely settles 
the question of which societies are espousable. If we allow an infinite number of 
men, problems may arise. By restricting ourselves to the case where each man has 
a finite list of women, we can use a theorem of Marshall Hall, Jr. [145] to settle 
the problem. 


Theorem 3.41. Suppose (M,W) is a society and for every m € M we have 
|W(m)| < Xo. Then the following are equivalent: 


1. Forevery S C M, 


S| <|W(S)|. 
2. (M,W) is espousable. 


Proof. First, we will prove the theorem for countable sets of men using a result 
from a previous section. Given a society (IV, W) with |/7| < No, consider the 
family of sets defined by the formula T = {W(m) | m € M}. Since |M] < No, 
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we could index the members of T with natural numbers. Also, since each W (m) 
is finite, we have that 7’ is a countable family of finite sets. By Theorem 3.16, T’ 
has a system of distinct representatives (SDR) if and only if the condition in item 
1 holds. From the definition of an SDR, T has an SDR if and only if (17, W) is 
espousable. Thus item | holds if and only if item 2 holds. 

Now we will prove the theorem for arbitrarily large sets of men. Our proof that 
item | implies item 2 will use the following compactness principle: “If T is a 
set of formulas and every finite subset of T’ has a model, then T’ has a model.” 
Just as in Section 3.6.2, if U is a model of 7’, then every object mentioned in 
formulas of T' will appear in U, and every formula of T' will be true of the objects 
in U. Suppose that (17, W) is a society satisfying item | in the theorem. For each 
m € M,if W(m) = {w1,...,wWn} is the set of women on m’s list, add the 
formula f(m) = w, V--: V f(m) = wp, to T. For every pair m,,m2 € M 
with m1 # mo, add the formula f(m,) 4 f(mz2) to T. By the condition in item 
1, every finite subset of 7’ has a model. By the compactness principle, JT has a 
model. Let f be the function in this model. Then f is a matching of the men into 
the women, as desired. The proof that item 2 implies item | is identical to that for 
Theorem 3.40. O 


From the proof, it appears that Theorem 3.41 for countable M/ is just Theorem 
3.16 with some of the terminology changed. Indeed, Theorem 3.41 for countable 
M could be added to the list of results in Theorem 3.17. Thus, in ZF we can 
prove that this marriage theorem for countable MM is equivalent to K6nig’s Lemma 
and also to the countable axiom of choice for finite sets (CACF). Even in axiom 
systems much weaker than ZF, a countable version of this marriage theorem can 
be shown to be equivalent to a weak version of K6nig’s Lemma [162]. 

Hall’s [145] original proof of Theorem 3.41 for arbitrarily large sets of men 
uses Zorn’s Lemma, a statement equivalent to the full axiom of choice (AC). 
Our proof uses the compactness principle from logic, which is equivalent to the 
Prime Ideal Theorem (PIT). Since PIT is weaker than AC (see [167]), we can also 
conclude that working in ZF it is not possible to deduce AC from Theorem 3.41. 
It is not known whether or not PIT can be deduced from Theorem 3.41 in ZF. 

It may seem odd that we restrict the length of the lists of potential wives in 
Theorem 3.41. If some man has an infinite list, then he must not be very picky. 
On the surface, it seems like it should be easy to find him a wife. However, our 
intuition here is based on finite societies, and infinite societies (like infinite sets) 
are very peculiar. 

Consider the following situation. Let M = {mo,mj,...} be the men in our 
society and let Y = {wo,wi,...} be the women. Let W(mo) = Y and for 
j > 0, let W(m;) = {w;-1}. If S C M, then the structure of W(S) falls 
into two nice cases. If mo € S, then W(S) D W(mo) D Y, so W(S) = Y. 
If mo ¢ S, then W(S) = {w, | mj4i € S}. In either case, |S| < |W(S)|. 
Thus, the society (17, W) satisfies item 1 in Theorem 3.41 and satisfies all the 
hypotheses of Theorem 3.41 except that |W(mo)| = No. However, this society 
is not espousable. By its construction, mm, must marry wo, m2 must marry wz, 
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and so on. If mo marries some w;, then m,;+1 will be deprived of a wife. Thus, 
because ™m has an infinite list, item 1 of Theorem 3.41 is no longer sufficient to 
guarantee that the society is espousable. 


Exercises 


1. Prove that if (17,W) is any espousable society then for every S Cc M, 
|S| < |W(S)|. (Hint: Since the society may have infinitely men or a man 
with an infinite list, neither Theorem 3.40 nor Theorem 3.41 applies di- 
rectly. However, you could adapt part of a proof.) 


2. Without using Theorem 3.17, prove in ZF that Theorem 3.41 for countable 
M implies CACF. 


3. Without using Theorem 3.17, prove in ZF that Theorem 3.41 for countable 
M implies Konig’s Lemma. 


4. Prove in ZF that Theorem 3.41 implies the axiom of choice for finite sets 
(ACF). 


5. Consider the society containing men M = {mo,mj,...} and women 
Y = {wo,wi,...}, where W(mo) = Y and W(m,;) = w,; for 7 > 0. 
Is (M, W) espousable? 


6. Construct a society such that all the following hold: 
(a) W(mo) is infinite. 
(b) There are infinitely many women who are not on 79’s list. 
(c) For every finite S c M, |S < |W(S)|. 
(d) (IM, W) is not espousable. 


7. Suppose (1/7, W) is a society in which exactly one man has an infinite list 
and for all finite sets S C M,|S| < |W(S)|. (Note the strict inequality.) Is 
(M, W) espousable? 


3.8.2 Countably Many Men 


The squad may count off in a line or column formation. 
— Drill sergeant study guide 


In this subsection, we will examine four marriage theorems for the situation where 
the population of men is countable. We begin with the theorem of Damerell and 
Milner [66]. The statement of this theorem incorporates a margin function /u,,,, 
inspired by a conjecture of Nash-Williams [211]. Although the exact definition of 
[4y, is somewhat technical, the underlying notion is easy. Suppose (17, W) is a 
society and let Y C W(M/) be some subset of the women. Build the set of all 
the men whose entire list lies within Y, soS = {me M|W(M) CY}. If Y 
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and S are finite, then |Y| — |.S| indicates how many extra women Y contains. If 
|Y | — |S| is ever negative, then the society is not espousable. The margin function 
[4, generalizes this notion of measuring the extra women in a set. We will define 
it precisely after we see how it is used in the following theorem of Damerell and 
Milner [66]. 


Theorem 3.42. If (M,W) is a society and |M| < No then the following are 
equivalent: 


1. For every Y C W(M) we have ju.,(Y) > 0. 
2. (M,W) is espousable. 


In a moment, we will define /1.,, and make some comments on the proof of the 
theorem. Before we do that, it is worth taking a minute to note how this theorem 
compares with those in the preceding and following subsections. First, in this 
theorem, each man may have any number of women on his list. This is a change 
from M. Hall’s Theorem 3.41 where each man was restricted to a finite number 
of women. On the other hand, Theorem 3.41 places no restriction on | /| and this 
theorem requires the set of men to be countable. This is not an idle restriction. In 
[66], Damerell and Milner provide an example of a society with an uncountable 
collection of men that satisfies item | of the theorem, but fails to be espousable. 
This example and a theorem for uncountable collections of men is given in Section 
3.8.3. 

To clarify the statement of the theorem we must define ju,,,. To do this we need 
some ancillary definitions. Let (17,W) be a society with || < No. For a set 
Y Cc W(M), a tower on Y is defined to be an infinite nested sequence of sets 
T = (T, | n < w) such that To C T; C To C... and Y = Uncy Ty. Given a 
tower T on Y, let d(T’) be the number of men whose lists are subsets of Y but not 
subsets of any element in the tower. That is, 


d(T) =|{m € M | W(m) CY AVnW(m) £ Ta} I: 


Let Z* denote the extended integers, including the usual integers plus symbols for 
infinities, so Zt = {...,—2,—1,0,1,2,...} U {—co, co}. Now we can define 
/tq for each ordinal a. In the case where a = 0, for Y C W(M) define 
lY|—|{meM|W(m) cCY}| if |¥| < No 
Lo(Y) = : 

The definition of the margin function for successor ordinals uses a special class 
of towers. Suppose that ju. has been defined and that it maps subsets of W(M) 
into Z*+. For each Y C W(M) let A,(Y) denote the collection of all towers 
T = (T, | n € w) on Y satisfying both wa(To) < oo and for alln < w, 


He(Tn) = Ha(To). Thus, Ag(Y) consists of all towers on Y such that pq is 
constant and finite on the elements of the tower. 
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With the formulation of A, (Y) in hand, we can complete the definition of the 
margin function. Suppose a = 3 + 1 and jug and Ag(Y) have been defined. Then 
for each Y C W(M) define pq by 


_ Jinfreagcy)(ua(To) —d(T)) if A(Y) 4 
Ho(¥) = {2 if Ag(Y) = 


If a is a limit ordinal, and jug is defined for all 3 < a, then define 


Ho(¥) = inf pa(¥). 
In particular, u,,(Y) = infgcu, we(Y) where w; is the smallest uncountable 
ordinal. 

In light of the complexity of the construction of j4,,,, it is not too surprising 
that the proof of Theorem 3.42 is beyond the scope of this book. However, we 
will make a few comments on the general structure of the proof and verify the 
theorem for a couple of societies. Theorem 3.42 states that a society with a count- 
able number of men is espousable if and only if VY C W(M) ju.,(Y) > 0. The 
proof that the society is espousable actually constructs the matching one man at a 
time. The condition on /i,,, insures that the construction can proceed at each stage. 
The countability of // insures that the men can be counted off as mo, m1, 7™2,... 
so that the construction does not involve a limit stage. To prove the converse, the 
authors assume the existence of a matching and show that for each a < w and 
each Y C W(M) the inequality 44(Y) > 0 holds. This argument is carried 
out in a number of lemmas, largely proved by transfinite induction. These induc- 
tion arguments have base cases, successor cases, and limit cases, mirroring the 
construction of /4,,,,. For full details, see [66]. 

As an aside, we should elaborate on this use of constructions and induction 
proofs indexed by ordinals. These proof techniques are referred to as transfinite 
recursion and transfinite induction. Both principles can be proved in ZFC. In the 
case of recursion, ZFC proves that given any formula 0(x, y) and any ordinal a, 
if for every x there is a unique y such that 0(x, y) holds, then there is a function 
f with domain dom(f) = a such that for every 6 < a we have 0(f |g, f(3)). 
(The notation f |g denotes the function that is the same as f for inputs less than 
@ and is undefined elsewhere. We call this the restriction of f to 3.) For a proof 
of this transfinite recursion theorem, see [88]. In terms of the construction of our 
margin function, the transfinite recursion theorem asserts that there is a function 
f with domain w; + 1 such that for each a < wy, f(a) = fq. Thus, not only does 
[uy exist, but a function encoding the entire construction of ju,,, exists. 

The transfinite induction principle states that given an ordinal a and a formula 
O(a), if both 6(0) and VB < a((Vy < 6B O(y)) — 4(G)), then VB < a A(B). 
Just as in regular induction, we can view 0(0) as a base case and the formula 
VG < a((Vy < 8 A(y)) — A(B)) as an induction step. The proof of the induction 
step is often broken into cases where (3 is a successor ordinal and where (3 is a limit 
ordinal. Like the standard induction scheme, transfinite induction can be viewed 
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as a consequence of a least element principle. If 6(/3) fails for some G < a, 
then the set {8 < a | —=6((G)} exists. This set is well ordered, and thus has 
a least element J. If G9 = 0 then the base case fails, while if Go > O then 
the induction step fails. In this fashion, the existence of a least Jo proves the 
contrapositive of the transfinite induction priniciple. This concludes our aside on 
transfinite recursion and induction; we return to our discussion of Theorem 3.42. 

We will now verify Theorem 3.42 for two societies. We begin with the society 
from Subsection 3.8.1 that was not espousable. 


Example (/, W). Let (V7, W) be the society in which 
M = {mo,m1,™Mo2,..-}, 
W(M) = X = {wo, wi, w2,.--}, 
W (mo) = X, and 
W(m,) = {w;-1} for each j > 0. 


As shown in Subsection 3.8.1, the society in Example (/, W) is not espous- 
able. Consequently, we should be able to find a set of women Y C W(M) such 
that fu, (Y) < 0. To do this, we will need to calculate some values of jp and [11, 
using the definitions following the statement of Theorem 3.42. 

If Y = {wj,,..., wi, } is a finite set of women, then the collection of all men 
m such that W(m) C Y is exactly {mj,41,...,™mi,+1}. Because Y is finite 
and W(mpo) is infinite, mo is never included in such a list. Since ji9(Y) is the 
cardinality of Y less the number of these men, jio(Y) = 0. On the other hand, if 
Y is infinite, then jp(Y) = co. Summarizing, 


O if/Y|<® 
ho(Y) = Mle 
oo if |Y| >No. 


Given ji9(Y), we can calculate Ao(Y) for each set of women Y. When Y is finite, 
[Uo is constantly 0 on every subset of Y, so Ao(Y) will consist of all infinite nested 
sequences of subsets of Y that include Y . (Note that elements in the tower can be 
repeated.) If Y is infinite, Ao(Y) consists of all infinite nested sequences of finite 
subsets of Y that eventually contain every element of Y. 

With juo and Ao(Y) in hand, we are ready to calculate a value of ju;. Consider 
the tower T = (T; | i < w) defined by T; = {w, | 7 < i}. We know that 
T € Ao(W(M)). For each j > 0, W(m,) C Tj, so mo is the only man satisfying 
both W(mo) C W(M) and VnW (mo) ¢ T,. From the definition of d(T), we 
have d(T) = 1. Now 


A\\ = : Ny ’ 
p(W(M)) =, int, (Ho(T3) ~ aT") 


< po(To) — d(T) 
=)0— 1 
=-l. 
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Thus 44;(W(M)) < 0. It can be shown that whenever a < {, wsa(Y) < balY). 
(See the exercises.) Consequently, j.,,(W(M)) < ui(W(M)) < 0, and the first 
item of Theorem 3.42 fails, verifying the theorem. 

To distinguish our next example from the work we have just completed, we will 
use boys and girls. 


Example (B,G). Let (B,G) be the society in which 
B = {bo, bi, ba,...}, 
G(B) = {90,91,92,---}, 
G(bo) = {go, gi}, and 
G(b;) = {9;} for each 7 > 0. 


It is easy to see that the map f(b;) = g(Z) is a solution to this marriage problem. 
We should be able to show that ju.,,(Y) > 0 for every possible set of girls Y. 

We begin by finding o(Y). If we choose a finite set of girls Y that contains 
go but omits gi, then {m € M | W(m) Cc Y} will consist of exactly those 
boys whose indices match the indices of the non-go girls of Y. In this case, go is 
“extra,” and io(Y) = 1. For other finite Y, o(Y) will always be 0. Summarizing, 
we have 

0 if |Y|]<NoA (go €YVGEY) 
Ho(Y)=41 if |¥|<NoAgG EY AM EY 
oo $if|Y| >No. 


Any sequence included in Ag(Y) must be an infinite nested sequence of finite 
subsets of Y whose union is Y. Additionally, to insure that for every n we have 
Ho(In) = fo(To), either we must have both go € To and gi ¢ Y, or we must 
have that go € Ty implies gi € Ty for all k. In the first case 4(T;,) = 1 for all 
n, and in the second case u(T;,) = 0 for all n. If W(mo) = {90,91} C Y, then 
W (mo) C T,, for some n. This implies that d(T’) = 0 for all T € Ao(Y). 

Now we are ready to calculate 41. If go € Y and g; ¢ Y, then pio (To) = 1 for 
all T € Ao(Y). Since d(T) = 0, we have p1(Y) = po(To) — d(T) = 1 in this 
case. On the other hand, if go ¢ Y or gi € Y then for every T € Ao(Y), go € Th 
implies g; € Ty. In this case uo(To) = 0, yielding pi (Y) = po(Zo) — d(T) = 
0 — 0 = 0. Summarizing, we have 


O if YVqmEY 
Ma(¥) = es ‘“ 
1 ifg EC YAQnE€Y. 


We have shown that for all Y C G(B), i(Y) > 0. Furthermore, it is not 
too hard to see that for every Y, Ao(Y) = Ai(Y). Since d(T’) is always 0, it 
immediately follows that w2(Y) = si (Y). An easy transfinite induction proves 
that fa(Y) = wi(Y) for all a and Y. In particular, when a = w, we have 
Hu, (Y) = fi (Y) > 0 forall Y, so the first item of Theorem 3.42 holds, verifying 
the theorem for this espousable society. 
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Of course, our two verifications do not constitute a valid proof, nor do they 
even provide a significant body of empirical evidence. However, they do reveal 
the structure of the margin functions, and the interaction between the margin func- 
tions and the matchings (or absence of matchings) in societies. 

After Damerell and Milner, other mathematicians devised necessary and suffi- 
cient conditions for the espousability of societies with countable sets of men. We 
will collect three statements of this sort below in Theorem 3.43. To unify their 
presentation, we will introduce the terminology of strings and admissibility as 
found in Wojciechowski [286]. 

The simplest form of admissibility is due to Podewski and Steffens [223]. The 
marriage problem (VM, W) is c-admissible if there is no subset J C M for which 
there is an element m € M — J such that W(m) C W(.J) and the society (J, W) 
has a unique solution. Note that if sucha J andm € M — J exist, then the society 
(J U{m}, W) is not espousable. Thus a society is c-admissible if it avoids these 
egregiously solutionless subsocieties. 

The other two forms of admissibility use the following notion. A string in 
(M, W) is a one-to-one function from an ordinal into MU W(M). We can have 
strings of men, strings of women, or mixed gender strings. A string f is saturated 
if no man precedes any woman on his list. That is, f is saturated if whenever 
f(8) =m, then W(m) C {f(a) | a < By. 

Saturated strings are used in the definition of y-admissibility introduced by 
Wojciechowski [286]. Define the function jz on strings f as follows. Suppose a is 
dom(f) (the domain of f). If a = 0, then let y.(f) = 0. If a = 6 + 1, then define 


uf) by 
u(f) = uf ta)+1_ if f(@) € W(M) 
wf ta)—1 if f(@) eM. 


If a is a limit, then define p(f) = liminfg_.o u( fg). We say that (WV, W) is 
p-admissible if (f) > 0 for every saturated string of (M,W). Informally, a 
saturated string is like a line of people. The margin function ju(f) indicates how 
many spare women are in the line defined by f. If (/, W) is ju-admissible, then 
the only way a line can have a negative number of spare women is if some man 
jumps in front of a woman on his list. 

Our third form of admissibility is due to Nash-Williams [212]. Suppose f is a 
string of women. Define q(f) as follows. Suppose a = dom(f). If a = 0, then 
define g(f) = —|{m € M | W(m) = 9}}. In the following let ran(f) denote the 
range of f and let f |g denote the restriction of f to G. If a = 6 + 1, then define 


a(f) = a(f Ta) — tm e M|W(m) C ran(f) AW(m) ¢ ran(f [)}I- 
When a is a limit, define 


a(f) = lign inf af ta)— 


l{m€ M|W(m) Cc ran(f) AVB <aW(m) ¢ ran(f [g)}]. 
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We say that (17, W) is g-admissible if q( f) > 0 for every string f of women. The 
construction of q parallels the definition of ju,,,, but substitutes strings for towers. 
This is Nash-Williams’ simplification of the concepts in Damerell and Milner’s 
work on his conjecture. 

With all this terminology, we can state Theorem 3.43, a sort of historical tour 
of marriage theorems for societies with countable populations of men. 


Theorem 3.43. If (1/,W) is a society and |M| < No, then the following are 
equivalent: 


1. (M,W) is espousable. 

2. (M,W) is c-admissible. 
3. (M,W) is q-admissible. 
4. (M,W) is p-admissible. 


Proofs of the equivalences above can be found in the papers of Podewski and 
Steffens [223] (for item 2), Nash-Williams [212] (for item 3), and Wojciechowski 
[286] (for item 4). Verifications of these marriage theorems for the previously 
analyzed examples are left as exercises. 


Exercises 


1. Prove that for all Y, a+i(Y) < wa(Y). 


2. Use transfinite induction to prove that for every pair of ordinals a and (3, if 
a < GB then p3(Y) < pa(Y) for all Y. (Hint: Fix a, let @ = a+ 7 and use 
induction on y. Exercise | is the induction step.) 


3. Prove directly that the society in Example (7, W) is not c-admissible. 
. Prove directly that the society in Example (17, W) is not g-admissible. 
. Prove directly that the society in Example (1/7, W) is not jz-admissible. 
. Prove directly that the society in Example (B, G) is c-admissible. 

( 

( 


. Prove directly that the society in Example (B, G) is q-admissible. 


Oo NN WBN WN Ff 


. Prove directly that the society in Example (B, G) is p-admissible. 


3.8.3 Uncountably Many Men 


If you can count the leaves of the trees, 
Or the foaming waves of the untamed seas, 
Then I will entrust to you alone 
To reckon the amours I have known. 
— Anacreontea, Ode 14 (translated by J. F Davidson [67]) 
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The marriage theorems of the previous section allow each man an arbitrarily long 
list of prospective wives, but require that the set of men is countable. This is not an 
idle restriction. The theorems all fail for certain societies with uncountably many 
men. 

Consider the following example, taken from [66]. We use w, to denote the first 
uncountable ordinal and define the society (17, W) by 


M = {mq |w<a<uy}, 
W(M) = {ya | a < w}, and 
Yma € M W(ma) = {yp | 8 < ah. 


Thus we have a man for every infinite ordinal less than w;, a woman for every 
ordinal (infinite or finite) less than w;, and each man’s list consists of those women 
whose indices are strictly less than his own. Since m,, is the man of lowest index 
(poor guy), he has the shortest list. Even he has an infinitely long list of potential 
wives. In fact, every man’s list contains exactly No women. 

We will show that (JV, W) is not espousable. By way of contradiction, suppose 
that f : M — W(M) is a one-to-one function. Beginning with any a such that 
w <a < w4, build a sequence of men as follows. Start with mq. If f(ma) = ys 
and 3 > w, add mg to the left of mq. If f(m+) = ya, add m, to the right of ma. 
Since f maps men to women with smaller indices, we must have G < a < ¥. 
Continue in this fashion, adding men to the right of the right end of the sequence 
and to the left of the left end of the sequence for as long as possible. Since the 
sequence to the left gives a descending sequence of ordinals, the sequence must 
have a left termination point. The only way for the sequence to terminate is if the 
last man is mapped to a woman with an index less than w. Thus every sequence 
constructed in this fashion terminates on the left with a man married to some 
woman y; with k < w. Because f is one-to-one, no two sequences can terminate 
with men married to the same woman, so each sequence terminates on the left 
with a man married to a unique woman y;, with k < w. This shows that there are 
countably many sequences. The sequence for any particular man m, could extend 
infinitely to the right. However, since there is no limit stage in the construction, 
the sequence to the right will have at most No elements. Thus we have countably 
many sequences, each containing countably many men. Every man is in one of 
these sequences, so there are countably many men. However, we said that the set 
of men is M = {mq | w < a < w }, which is uncountable. This contradiction 
shows that the society (IV, W) is not espousable. 

If we could apply Theorem 3.43 here, then (IV, W) would not be c-admissible. 
However, (1/,W) is c-admissible, as the following argument shows. Suppose 
J C M is any subset of men such that the subsociety (J, W) is espousable. Let 
f : J — W(J) be a one-to-one function espousing (J, W). On the one hand, 
suppose that for some m. € J, f(ma) = yg and 8+ 1 < a. We can modify 
the solution f by setting f(ma) = yg+1 and, if necessary, marrying the former 
husband of yg+i to yg. In this case, f is not a unique solution to the marriage 
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problem. On the other hand, if f(m™.) = yg always implies G + 1 > a, then 
nobody is married to yo. Thus, we can modify f by marrying someone to yo. 
In either case, the solution of (J, W) is not unique, so (IM, W) is c-admissible. 
Similar proofs could be used to show that (IW, W) is q-admissible, j:-admissible, 
and satisfies item | of Theorem 3.42. Thus none of the results of the previous 
section remain true when the hypothesis of the countability of MM is removed 
from their statements. 

In [2] and [1], Aharoni, Nash- Williams, and Shelah present a marriage theorem 
that does allow uncountable populations of men. Unlike Theorem 3.41, there is 
no restriction on the lengths of the lists of the men. Thus the hypothesis for the 
statement is particularly brief. Here is the theorem. 


Theorem 3.44. Suppose (M,W) is a society. The following are equivalent: 


1. (M,W) is espousable. 


2. For every cardinal k such that0 < & < No or & is regular, there is no 
«-obstruction in (M,W). 


Since the proof of this theorem in [1] is twenty-five pages long, we will not 
replicate it here. However, to understand the statement of the theorem, we should 
unravel the definition of a K-obstruction. This will require application of some of 
our knowledge of cardinal numbers. Item 2 above mentions finite cardinals, which 
are just natural numbers, and regular cardinals, which are defined in Section 3.5.3. 
Good examples of regular cardinals include No and &,. By contrast, X., is good 
example of a singular (i.e. not regular) cardinal. 

Every «-obstruction turns out to be a subsociety of (IV, W), and some new 
notation will prove helpful. In this setting, it is easiest to identify (IZ, W) with 
the bipartite graph with vertices in 17 and W(M/). Given some subset A of the 
vertices (people) in (IM, W), we can easily construct the subsociety (which we 
will also denote with A) corresponding to the subgraph of (7, W) with vertices 
in A. We can also form (IV, W) — A, the society whose graph is the subgraph 
of (IM, W) on the vertices that do not appear in A. We say that A is a saturated 
subsociety of (7, W) if whenever m € A, W(m) CA. 

Our definition of «-obstructions is a transfinite recursion with infinitely many 
base cases. Here are the base cases. Suppose that 0 < K < No or K = No. We say 
that A is a «-obstruction of (17, W) if A is a saturated subsociety and there is a 
set L C M with |Z] = « such that A — L is espousable, but for every woman 
w € A, the society A — (LU {w}) is not espousable. Informally, if A is a K- 
obstruction (for « < No), then L is a set of excess men preventing A from having 
an espousal. The requirement that A is saturated transfers the unespousability of 
A to the society (17, W). Thus if (7, W) has a «-obstruction for & < No, then 
(MM, W) is not espousable. 

To define «-obstructions for uncountable regular cardinals «, we will need to 
review and introduce some set theoretic terminology. In Section 3.6.3 we said that 
a set CC x is club (shorthand for closed and unbounded) if 
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e the limit of each sequence of elements in C is either « or in C’, and 


e Vac KABEeC(a< #). 


We say that a set S' C « is stationary if for every club C C k, we have SNC F 0). 
To use a topological metaphor, stationary sets meet every club set. 

Now we are ready for the inductive step in the definition of «-obstructions of 
(M, W). Suppose that « is regular, and we have defined p-obstructions for every 
Ju < & such that 0 < yw < No or p is uncountable and regular. We say that a 
subsociety A of (V,W) is a «-obstruction of (17, W) if there is a transfinite 
sequence (A..)a<x of disjoint subsocieties such that 


e A= nega: 


e for each a < &, either Ay is a single woman, or for some 3 < xk, the 
subsociety A, is a 3-obstruction in (17, W) — UgcaAg, and 


e the set of a < « for which A, is not a single woman is stationary. 


This completes the definition of «-obstructions for all those cardinals « such that 
0<K« < No ore is regular. 

At the beginning of this section we showed that the society (7, W) that has 
M ={ma |w <a < wi} andW(ma) = {yg | 6 < a} for all infinite a < w) 
is not espousable. In light of Theorem 3.44, this society must have a «-obstruction 
for some « that either satisfies 0 < & < No or is a regular cardinal. In fact, we can 
find an 8-obstruction for (/, W). 

We define the §1-obstruction for (17,W) by applying transfinite recursion. 
Let Ao = {yo}. For each successor ordinal a + 1 < Ny, let Agsi = {Ya41}. 
Finally, if A < jy is a limit ordinal, let Ay = {m),m)+1, yy}. This defines a 
sequence (Aq) qc, of disjoint subsocieties of (IZ, W). Let A = Uncr, Aa. We 
must use the definition of a «-obstruction to prove that A is an §j-obstruction. 
For each a < Nj, either the subsociety A, is a single woman or a is a limit. 
If a is a limit, then Ag = {ma,™Ma+41, Ya}. Since Ya is the only woman in 
(M, W) — Up<aAg with an index less than a + 1, Ag is a saturated subsociety 
of (M,W) — UgeaAg. Furthermore, the subsociety A,, — {m,,} is espousable, 
but Ay — {ma, Ya} is not. Thus if a is a limit, then A, is a 1-obstruction of 
(M,W) — Ugeao. Finally, the set of a < 1 such that A, is not a single 
woman is exactly the set S = {\ < §; | \ isa limit}, which is stationary. (This 
is an exercise.) This completes the verification that A is an %,-obstruction of 
(M,W). 

If we think of a society as a bipartite graph, then every «-obstruction is a sub- 
graph of the society. With this viewpoint, Theorem 3.44 states that the graph for 
a society has a matching of the men into the women if and only if it contains no 
subgraphs of a particular form. Thus, Theorem 3.44 is a “forbidden subgraph” 
characterization of all the espousable societies. 
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Exercises 


1. Prove that if « is an uncountable regular cardinal, then the set of ordinals 
{\ < «| Ais a limit} is stationary. 


2. Prove that if a society (17,W) has an No-obstruction, then it also has a 
1-obstruction. 


3. Suppose that (17, W) is the society with 
M ={m,|a< wt, 
W (ma+1) = {Wa} for each a < w, and 
W(mo) = {Wa | a < wh. 
Complete the following. 


(a) Find a 1-obstruction for (17, W). 
(b) Prove that (V7, W) has no 2-obstructions. 


4. Suppose that (1/7, W) is the society with 


M ={ma|w<a<auy}, 
W(M) = {ya | a < wy}, and 
VYma € M W(ma,) = {ye | 8 < ah. 


Complete the following. 


(a) Prove that (IV, W) has no 1-obstruction. 


(b) Prove that (IZ, W) has no No-obstruction. (Hint: You could do and 
apply Exercise 2.) 


(c) Prove that (I, W) is p-admissible. 
(d) If you survived part 4c, then prove that (7, W) is p-admissible. 


3.8.4 Espousable Cardinals 


It is easy to deduce from this that weakly compact cardinals are very 
large... 
—E.C. Milner [206] 


For the past two sections we have been modifying our marriage theorems in order 
to make them hold for larger and larger cardinals. This process has been pretty 
successful, resulting in Theorem 3.44 that can handle arbitrarily large societies. 
However, the statement of Theorem 3.44 is substantially more complicated than 
that of Marshall Hall Jr.’s Theorem 3.41. An alternative approach is to concoct a 
natural simple generalization of Theorem 3.41 and ask which cardinals satisfy it. 
This is the motivation for the following definition. 
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Definition. An infinite cardinal « is espousable if every society (IM, W) that 
satisfies the following criteria is espousable. 


1. |M| =k, 
2. ¥m € M |W(m)| < «, and 
3. if Mo C M and |Mp| < «, then (Mo, W) is espousable. 


Informally, “« is espousable” means that a result very like Theorem 3.41 holds 
for all societies of size x. There is some variation from the theorem. The third 
clause requires espousability of all small subsocieties where Theorem 3.41 just 
specifies that there is a sufficiently large number of women. This modification 
is necessary, since the unespousable uncountable society from the beginning of 
Section 3.8.3 satisfies the cardinality requirements from Theorem 3.41 and could 
be embedded in any uncountable society. 

From Theorem 3.41, we know that No is espousable. The unespousable and 
uncountable society from the beginning of Section 3.8.3 shows that Nj is not 
espousable. (You may wish to check the details of these claims as exercises.) Thus 
the espousable cardinals form a proper subclass of the class of all cardinals. The 
following two theorems of Shelah [252] give additional properties of espousable 
cardinals. 


Theorem 3.45. /f & is an uncountable espousable cardinal, then k is a limit 
cardinal. 


Proof. It is easiest to show that if « is an uncountable successor cardinal, then « 
is not espousable. We will show that a particular uncountable successor cardinal, 
No, is not espousable, and leave the generalization of the argument as an exercise. 

Our goal is to show that XN» is not espousable. We modify the example from the 
beginning of Section 3.8.3 as follows. Let 


M = {ma |i < a < No}, 
W(M) = {ya | a < No}, and 
for each ma € M, W(ma) = {ye | B < a}. 


We must verify that (17, W) satisfies all the criteria in the definition of espousable 
cardinal. The cardinality of M is Ng and for each man mq, |W(ma)| = Xi < No. 
Furthermore, if Mp C M and |Mo| < Ne, then |Mo| = 1, so any bijection 
between Mp and {yq | @ < N,} is an espousal of the subsociety (Mo, W). 
Summarizing, (17, W) satisfies all the criteria listed in the definition of espous- 
able cardinals. However, the argument on page 337 can be adapted to show that 
(MM, W) is not espousable. Thus X2 is not an espousable cardinal. Generalization 
of this argument to other successor cardinals is left as an exercise. Oo 


Theorem 3.46. [f « is an uncountable espousable cardinal, then k is regular. 
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Proof. As in the preceding theorem’s proof, it is easiest to show that if & is both 
uncountable and not regular then & is not espousable. Again, we will prove the 
theorem for a particular case and leave the generalization as an exercise. 

Suppose that « is X.,. We know that X,, is not regular because UgewXa = Nw- 
We will show that X,, is not espousable by gluing together versions of the society 
from the end of Section 3.8.1. Let (V7, W) be the society defined by: 


M = {p}U{ma | a< Xu}, 
W(M) = {we | a < Xo}, 


W (ma) = {Wa} ifa g {Ry | n € wh, 


mr, ) = {we | Rn < B < Rnaits and 


( 
( 
w( 
( 


W (p) = {wx,, |r € w. 


It is easy to verify that (7, W) satisfies the first two criteria in the definition of 
an espousable cardinal. Suppose that Mp C M and |Mo| < Nw. If p € Mo then 
matching each man to the woman with his index yields an espousal. Ifp € Mo, we 
must work a little harder. Find the least 7 with Mp DZ {ma | Nj < a < Nj41}, and 
let a be some ordinal in [Nj, X41) such that ma, ¢ Mo. If we let f(p) = wx,, 
f(mxy,;) = Wao, and f(mg) = we for all other 8 € Mo, then F is an espousal 
of the subsociety for Mo. If 8, was espousable, then (IV, W) would have an 
espousal. However, any assignment of a wife to man p leads to a contradiction. 
Thus &,, is not an espousable cardinal. To generalize this argument to other non- 
regular cardinals, just replace No, Ni,... with any cofinal sequence witnessing 
that the cardinal under consideration is singular. O 


On the basis of the preceding two theorems, we know that if « is an uncountable 
espousable cardinal, then « is an uncountable regular limit cardinal, and so & is 
weakly inaccessible. Applying Theorem 3.35, ZFC cannot prove the existence of 
an uncountable espousable cardinal. However, the following theorem shows that 
espousable cardinals are not too high in our hierarchy of large cardinals. (Weakly 
compact cardinals are very large, but not too large.) 


Theorem 3.47. Every weakly compact cardinal is espousable. 


Sketch of proof. Suppose that « is weakly compact and (1/7, W) is a society that 
satisfies the criteria in the definition of espousable cardinals. Without any loss of 
generality, let M = {mq | a < «}. Build a tree of height « in which the nodes 
at level a of T are labeled with the names of women in W(m,), and each path 
through the tree is an initial segment of an espousal of (MV, W). Since « is weakly 
compact, it is a regular strong limit cardinal. This can be used to prove that each 
level of T has size less than &. Applying Theorem 3.39, use the tree property 
for « to find a path through T of length «. This path is the desired espousal of 
(M,W). O 
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We now have both upper and lower bounds on the size of the least uncountable 
espousable cardinal, but there are some obvious questions. Is every uncountable 
espousable cardinal a strong limit cardinal? Does every uncountable espousable 
cardinal have the tree property? Is every uncountable cardinal with the tree prop- 
erty espousable? At this writing, these questions appear to be open. 


Exercises 
1. Show that No is espousable. 
2. Show that Xj is not espousable. 


3. Complete the proof of Theorem 3.45 by generalizing the argument to other 
successor cardinals. 


4. Complete the proof of Theorem 3.46 by generalizing the argument to other 
singular cardinals. 


5. Fill in the details of the proof of Theorem 3.47. 


3.8.5 Perfect Matchings 


T actually found someone for me. 
— Whitney Houston 


A society is called symmetrically espousable if it has an espousal that maps the 
men onto the women. What a difference one word makes! Requiring the espousal 
to be onto insures that every person in the society will have a spouse, without 
regard to their gender. If we view societies as bipartite graphs, an onto espousal 
gives a perfect matching between the two sets of vertices. 

A bit more notation makes it easy to formulate symmetric marriage theorems. 
A symmetric society is a quadruple (M, W, W, M) such that (M, W) is a society 
consisting of the men and their lists, (W,, 17) is a society consisting of the women 
and their lists, and the lists match appropriately. More precisely, we require that a 
woman is on a man’s list if and only if he is on her list, so w € W(m) if and only 
if m € M(w). We can now formulate and prove a symmetric version of Theorem 
3.41. 


Theorem 3.48. Suppose (M,W,W,™M) is a symmetric society in which each 
person has a finite list. More formally, for every m € M and w € W, we have 
|W (m)| < Xo and |M(w)| < Xo. Then the following are equivalent: 


1. For every S C M, |S| < |W(S)| and for every T c W, |T| < |M(T)|. 


2. (M,W,W,M) is symmetrically espousable. 


Proof. To prove item 2 implies item 1, it suffices to note that if (MM, W, W, M) is 
symmetrically espousable, then (/,W) and (W, M) are espousable. Applying 
Theorem 3.41 twice yields item 1. 
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To prove the converse assuming item 1, Theorem 3.41 says that the societies 
(M,W) and (W, M) have espousals. Thus we have one-to-one functions from 
the men into the women and from the women into the men. The statement of the 
Cantor—Bernstein Theorem (Theorem 3.20) says that we must have a bijection 
between the men and the women. This does not quite finish the proof, since an 
arbitrary bijection might not map a person to someone on their list. However, the 
proof of Theorem 3.20 shows that this bijection can be constructed using only 
values from the initial injections. In our setting, suppose f : M — W is an 
espousal of the men, g : W — M is an espousal of the women, and h : M > W 
is the bijection constructed as in the proof of Theorem 3.20. Then h(m) = w 
implies that either f(m) = w or g(w) = m. Thus, either w is on m’s list or m is 
on w’s list. By the symmetry of the society, each must be on the other’s list. Thus 
h is a one-to-one and onto map between the men and the women that matches 
each person with someone on their list. O 


The preceding theorem and proof can be adapted to formulate and prove new 
symmetric versions of all of our marriage theorems. We leave these results as 
exercises. 


Exercises 


1. Formulate and prove a symmetric version of the finite marriage theorem, 
Theorem 3.40. 


2. Formulate and prove a symmetric version of the theorem of Damerell and 
Milner, Theorem 3.42. 


3. Formulate and prove a symmetric version of the countable omnibus mar- 
riage theorem, Theorem 3.43. 


4. Formulate and prove a symmetric version of the uncountable marriage the- 
orem, Theorem 3.44. 


5. Define symmetrically espousable cardinals. Using your definition, is there 
a cardinal that is espousable but not symmetrically espousable? 


3.9 Finite Combinatorics with Infinite 
Consequences 


Does mathematics need new axioms? 
— Solomon Feferman [104] 


In this section we will discuss a remarkable result due to H. Friedman [113]. 
Friedman has concocted a finite combinatorial statement that he calls Proposition 
B. Under the appropriate consistency assumptions, he shows the following: 
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1. For each k € w, the axiom system consisting of ZFC plus “there is a k- 
subtle cardinal” cannot prove Proposition B. 


2. The axiom system consisting of ZFC plus “for every k € w there is a k- 
subtle cardinal” can prove Proposition B. 


Thus, the use of large cardinals is required in the proof of the finite combinatorial 
statement of Proposition B. 

Proposition B is not simple, but it can be understood with a modest effort. 
We will start with the full statement, and then gradually define any mysterious 
terminology. 


Proposition B. For every k > 0 and p > 0, there is an integer n such that if 
{fx |X C [n]*} is any #-decreasing function assignment for |n]*", then we can 
find sets A and E satisfying 


e Eis subset of {0,1,2,...,n} with p elements, 
e [E|\k CAC [n]*, and 
e fa has no more than k* regressive values on |E\*. 


In the statement of the proposition, p, n, and k represent natural numbers. 
Here, the notation [n]* denotes the set of all ordered k-tuples with coordinates 
i {01% ci. 1) 

If x is a k-tuple, then |a| is the maximum coordinate of x and min(zx) is the 
minimum coordinate. For example, if « = (1,3,0), then |2| = 3 and min(a) = 0. 
If f is a function mapping k:-tuples to k-tuples, we say that y is a regressive value 
for f on [E]* if for some x € [E]*, y = f(x) and |y| < min(z). For example, 
if f(7,6,5) = (1,3, 2), then (1,3, 2) is a regressive value for f because 3 < 5. 
A single regressive value may have many witnesses. The function g : [8]? — [8]° 
defined by g(x,y, z) = (0,0,0) for all 0 < x,y,z < 8 has only one regressive 
value, namely (0, 0,0). 

A function assignment on [n]" assigns one function to each nonempty subset 
X C [n]*. The function assigned to X is required to map X to _X, so we always 
have fx : X — X. We say that the function assignment {fx | X C [n]*} is 
#-decreasing if whenever A C [n]* and x € [n]*, either 


¢ faufe} and f, agree on all elements of A, or 


e there is a y such that |y| > |x| and |fa(y)| > |fausa}(y)|- 


Some sense of the nature of #-decreasing function assignments can be gained 
from looking at a very small example. Suppose we set & = 1 and consider the 
possible function assignments for [2]'. Since we are required to set f;9}(0) = 0 
and f;;;(1) = 1, every function assignment on [2]! is completely determined by 
the values of fo,1}. We will look at all four possible cases. 


Case 01: Suppose fo,1;(0) = 0 and fro13(1) = 1. Since fro,1} extends 
both f¢o; and f,;}, this is a #-decreasing function assignment. 
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Case 00: Suppose fro,1;(0) = 0 and fro1;(1) = 0. Since fro,1} extends 
feo} and friy(1) = 1 > 0 = feryuso}(1), this is also a #-decreasing 
function assignment. 


Case 10: Suppose f¢9,1;(0) = 1 and fo,1;(1) = 0. This is an acceptable 
function assignment, but it is not #-decreasing. If we let A = {0} and 
set x = 1, then faus2}(0) = 1 # 0 = fa(0), so the first clause of the 
definition of #-decreasing fails. Because x = 1, there is no y € A such 
that |y| > |x], so the second clause fails also. 


Case 11: Suppose f9,1;(0) = 1 and f9,1;(1) = 1. This is an acceptable 
function assignment. Because f;o,1;(0) = 1, imitating the argument in the 
preceding case will show that this function assignment is not #-decreasing. 


In general, the second clause of the definition of ##-decreasing function assign- 
ment forces the values of the functions to be pushed down. One might think that 
this would always lead to a proliferation of regressive values. But Proposition B 
asserts that if we start with a large enough domain, then even when the function 
assignment is #-decreasing there will be a function on a large subset that is not 
regressive at too many values. 

Proposition B is a remarkable statement. Perhaps it is too remarkable to be 
true. It is important to remember that proving the proposition inherently requires 
large cardinal assumptions that are well beyond the strength of the axioms of 
ZFC. Should we automatically tack these large cardinal axioms onto our collec- 
tion of everyday set theory axioms? Feferman [104] would consider this rash. 
Does Friedman’s result offer novel insights into the consequences of assuming 
large cardinal axioms? Absolutely. 


Exercises 


1. Let I denote the function assignment on [n]* defined by setting f(x) = x 
for every A and every x € A. Show that J is a #-decreasing function 
assignment. 


2. Let M denote the function assignment on [n]! defined by setting f4(x) = 
min(A) for every A and every x € A. 


(a) Show that / is a #-decreasing function assignment. 


(b) If f4 € M, what is the maximum number of regressive values that f 4 
can have? What is the maximum number of witnesses that a regressive 
value for f.4 can have? 


3. Let S' denote the function assignment on [n]! defined by setting 


x ifa < |Al, 
fal@)=9_ is 
min(A) ifa=|Al, 


for every A and every x € A. Is S a #-decreasing function assignment? 
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4. Find a function assignment on [3]! that is not #4-decreasing. If you would 
like a challenge, find all of them. 


3.10 k-critical Linear Orderings 


It appears that Feferman is using the word ‘need’ in a sense that 
requires discussion. 
— Harvey Friedman [105] 


In [114], H. Friedman reveals the connection between k-subtle cardinals and a 
very natural Ramsey-style property on linear orderings. These new results are not 
expressions of finite combinatorics like those of Section 3.9, but they have the 
advantage of requiring less terminology. 

Suppose that (X, <) is a linear ordering. As you may recall from Section 3.5.2, 
this means that the relation < is antisymmetric, transitive, and satisfies the tri- 
chotomy law. We say that X has no endpoints if it has neither a maximum nor a 
minimum element. The ordinal w is certainly a linear ordering, with 0 as an end- 
point. The integers are a familiar example of a linear ordering without endpoints, 
as are the rational numbers and the real numbers. 

Suppose that (X,<) is a linear ordering without endpoints and that k > 0 
is a natural number. We say that f : [X]* — X is a regressive function if for 


every increasing k-tuple 71 < %2 < +++ < &, we have f(21,%2,...,@%) < %1. 
A linear ordering is k-critical if for every regressive function f : [X]* > X 
there is a sequence of elements b; < --- < bx41 such that f(b1,...,b%) = 
f (be,..., be4t). 


This is all the machinery that we need to state (some of) Friedman’s results! 
Here are two of his theorems. 


Theorem 3.49. For each nonzero k € w, ZFC proves that k is the least cardinality 
of a (k + 1)-critical linear ordering if and only if k is the least k-subtle cardinal. 


Theorem 3.50. ZFC proves that there is a k-critical linear ordering for every 
nonzero k; € w if and only if there is a k-subtle cardinal for every nonzero k € w. 


For proofs of these theorems and many related results, see [114]. It is worth 
noting that Theorem 3.49 could be reformulated without using the word “least.” 
Working in ZFC, suppose there is a (k + 1)-critical linear ordering X. Let \ = 
|X |. Applying the axiom of separation we can form the subset S of those cardinals 
a < A such that there is a (k+1)-critical linear ordering Y with |Y| = a. Since S 
is a set of cardinals, it has a least element; call it «. Then « is the least cardinality 
of a (k+1)-critical linear ordering. Summarizing, if there is a (k+1)-critical linear 
ordering, then there is a least cardinality of a (& + 1)-critical linear ordering. The 
converse of this implication is obvious. A similar argument based on the definition 
of subtle cardinals shows that there is a k-subtle cardinal if and only if there is a 
least k-subtle cardinal. Consequently, Theorem 3.49 could be stated as “there is a 
(k + 1)-critical linear ordering if and only if there is a k-subtle cardinal.” 
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Is it reasonable to assert that k-critical linear orderings exist? Familiar linear 
orderings like the integers, the rationals, and the reals, are much too small to be 
even 2-critical. (See the exercises.) The smallest 2-critical linear ordering must be 
larger than some weakly compact cardinal. On the other hand, the least Ramsey 
cardinal is larger than some k-critical linear ordering for every k. If we believe 
that k-subtle cardinals exist, then we must also accept the existence of k-critical 
linear orderings. If we reject the existence of weakly compact cardinals, then we 
must also reject the existence of k-critical linear orderings. The existence of these 
objects is independent of ZFC (assuming ZFC is consistent), so we are back to 
deciding whether or not to add new axioms. 

We can be certain about one thing. Friedman’s theorems give us a characteri- 
zation of k-subtle cardinals that does not mention (n, &)-sequences or club sets. 
This elegant description of k-subtle cardinals could be a help in discussions about 
their existence. 


Exercises 


1. LetZ = {...,-—2,—1,0,1,2,...} denote the set of all integers and define 
f:Z— Zby f(x) = x — 1. Show that f is regressive. 


2. Prove that Z is not 2-critical. (Hint: Problem | is useful.) 
3. Let Q denote the rational numbers. Show that Q is not 2-critical. 
4. Let R denote the real numbers. Show that R is not 2-critical. 


5. Prove that no countable linear ordering with no endpoints is 2-critical. 
(Hint: Any such linear ordering has countably many pairs and also has a 
countable unbounded descending sequence. These can be used to build the 
needed regressive function.) 


6. Suppose that « > No and « is regular. Prove that if is a linear ordering of 
cardinality « containing an unbounded descending sequence of size «, then 
L is not 2-critical. 


7. (This one is very challenging.) Without using Friedman’s theorem, try to 
prove that if & is a Ramsey cardinal then there is a 2-critical linear ordering 
of size K. 


3.11 Points of Departure 


Where do you want to go today? ® 
— Microsoft 


This chapter can be summarized as a study of pigeonhole principles, K6nig’s 
Lemma, and Ramsey’s Theorem and their connections to cardinal numbers. Our 


3.11 Points of Departure 349 


slightly obsessive focus on cardinals is not the only approach to studying infinite 
objects. In this section we take brief peeks at computability, reverse mathematics, 
and the analytical hierarchy. For each topic area, we outline an application to 
graph theory and combinatorics, and state an open question. The section closes 
with a list of Ramsey-style theorems and named trees. 


Computability 


We say that a function f : N — N is computable if there is a program written 
in C that given an input of n always halts with output f(n). Any programming 
language can be substituted for C. A set M C N is computable if its characteristic 
function x,y is a computable function. The function yj, is defined by setting 
xm(n) = Lifn € M and y(n) = Oif n ¢ M. Our definition extends in 
the obvious way to handle functions on N* and subsets of N*. The computable 
functions are sometimes called Turing computable functions or general recursive 
functions. The study of these functions is called computability theory or recursion 
theory. Good introductory texts include [31] and [259], and [120] is a survey of 
computability in graph theory and combinatorics. 

Every computable function has a program. We can think of this program as an 
integer; it certainly would be stored in a computer as a string of zeros and ones that 
could be thought of as a single big integer. We use {m}(n) = k as shorthand for 
saying that when the machine with code m receives input n, it halts with output 
k;, We frequently refer to these codes as indices for computable functions. 

Not every subset of N is computable. For example, consider the set 


H={meEN|{m}(m) = 0}. 


To show that H is not computable, we consider yz and prove the following the- 
orem. 


Theorem 3.51. \ 4 is not computable. 


Proof. Suppose that yz is computable. Let m be the code for a program that 
computes \ 7. Then 


XH(m)=0 + {m}(m)=0 
o meH 
= xXH(m) = 1. 
Since 0 ¥ 1, yz is not computable. O 


Because computer programs process in discrete steps, we can examine compu- 
tations at various stages. We write {m}(n) |, if the program with index m and 
input 7 halts in fewer than s steps. We also write {m}(n) | if the program even- 
tually halts. It is easy to mechanically check whether a program halts in s steps, 
but impossible to mechanically determine which programs will eventually halt. 
Indeed, the set H is referred to as the solution to the self-halting problem. 
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Using our extended shorthand, we can encode noncomputable sets like H in 
graph theory problems. For example, suppose we define a coloring F on [N]? by 
the following rule. Let {7, 7, k} be any three-element subset of N, assume that we 
havei < 7 < k, and let 


ie oye ie (Vom <i) {m}(ma) Lie {mm} (rn) Le 
blue otherwise. 

Thus F'(2, 7, k) is red when for every machine with a code m < i, we get the same 
information about whether or not {m}(m) halts by checking 7 steps as we do by 
checking k steps. If F'(i, j, &) is blue, then for some m < i the machine {m}(m) 
halts between step 7 and step k. By Ramsey’s Theorem, F' must have an infinite 
monochromatic set; call it T = {to,t1,t2,...} and assume that we have listed 
the elements in increasing order. No matter how we select three elements from 
T, applying F' should always give us the same color. Since there are only finitely 
many machines with codes less than to, we cannot have one of them stopping 
between ¢, steps and t;;1 steps forevery7 > 1. (Smell any pigeons?) Thus 7’ must 
be a red monochromatic set. The elements of T’ are very handy for computing the 
set H. Pick any machine m. We know that m < t,,, and if {m}(m) ever halts, 
it must halt by step t,,41. To see this, suppose that {m}(m) halts at a later step 
k. Then ty41 > k and F'(tm,tm+1,tk+1) would be blue, contradicting the claim 
that T’ is a monochromatic red set. Thus, using yr as a subprogram, we could 
easily write a program that computes x77. Since yz is not computable, yr is not 
computable either, and we have proved the following theorem. 


Theorem 3.52. There is a computable coloring of {N]° that has no infinite com- 
putable monochromatic set. 


Many other types of coloring problems lead to noncomputable sets. For ex- 
ample, there is a computable 2-regular 2-colorable graph with no computable 
2-coloring. However, bumping up the size of our color palette can lead to com- 
putable colorings. Schmerl [248] proved that if G is a computable d-regular graph 
that is n-colorable, then G' has a computable 2n — 1 coloring. Depending on the 
value of d, it may or may not be possible to find a computable coloring with fewer 
than 2n — 1 colors. For 2 <n < m < 2n — 2, let D(n,m) be the least degree 
d such that there is a d-regular n-colorable graph with no recursive m coloring. 
Beyond a few bounds (see [120] and [248]), little is known about the values of 
D(n,m). For example, we know that 6 < D(5,5) < 7, but the exact value of 
D(5,5) is not known. 


Reverse Mathematics 


Reverse mathematics is a program of mathematical logic that was founded by 
H. Friedman (as in Section 3.9) and S. Simpson. The goal of the program is to 
measure the logical strength of mathematical theorems by proving that each one is 
equivalent to some statement in a hierarchy of axioms for second order arithmetic. 
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These proofs are carried out in a weak base system, RCAg, which consists of PA 
with restricted induction plus a comprehension axiom that essentially asserts the 
existence of computable subsets of N. We refer to these systems as second order 
arithmetic because the formulas include variables for numbers and variables for 
sets of numbers. Simpson’s book [254] is a comprehensive source for information 
about reverse mathematics. A number of theorems about infinite graphs have been 
analyzed, including the following example. 


Theorem 3.53. RCAg can prove that the following are equivalent: 
I. Every 2-regular graph with no cycles of odd length is bipartite. 


2. Weak Konig’s Lemma: Every infinite tree in which every node is labeled 0 
or | contains an infinite path. 


Since a graph is bipartite if and only if it is 2-colorable, it is not hard to adapt 
the solution of Exercise 3 of Section 3.1 to prove that 2 implies 1. The proof that 1 
implies 2 is less obvious, and the published proof uses an intermediate equivalent 
statement [162]. 

There are many open questions in reverse mathematics that are related to com- 
binatorics and graph theory. For example, can RCAg prove that Ramsey’s Theo- 
rem for pairs and two colors implies Weak K6nig’s Lemma? It is known that the 
converse is independent of RCAg and that Ramsey’s Theorem for triples and two 
colors implies a much stronger version of K6nig’s Lemma. 


The Analytical Hierarchy 


Subsets of N that are definable by formulas of second order arithmetic are called 
analytical, and can be organized by the complexity of their defining formulas. 
The resulting hierarchy of sets is analogous (but not identical) to the hierarchy of 
projective sets studied by descriptive set theorists. Both [240] and [161] provide 
useful background for the study of the analytical hierarchy. 

To state any results, we need some terminology. A set is “}-definable if it 
is definable by a formula containing no universal set quantifiers. We say that a 
set is U}-complete if it is U}-definable and every other “}-definable set is 1- 
reducible to it. Here, B is 1-reducible to A if there is a computable one-to-one 
function f such that for all n,n € B if and only if f(n) € A. Naively, each ©}- 
complete set embodies all the information content of every other “'}-definable 
set. One interesting characteristic of these sets is that any defining formula for a 
xt-complete set must contain a set quantifier. Thus, no formula containing only 
quantifiers on numbers can possibly define a ©t-complete set. 

An infinite graph G' has a Hamiltonian path if there is a sequence of vertices 
Uo, V1, V2,... Such that each vertex of G appears exactly once in the list and 
v;U;+1 is an edge of G for each 7 € N. An index for a computable graph is the 
code for the characteristic function of the edge and vertex sets for the graph. Using 
all this terminology, we can state the following theorem of Harel [153]. 
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Theorem 3.54. The set of indices of computable graphs with Hamiltonian paths 
is S}-complete. 


For a similar problem, suppose we weight the edges of an infinite graph with 
rational numbers that sum to 1. Such a graph may or may not have a minimal 
spanning tree. Is the set of indices of computable graphs with minimal spanning 
trees a }t-complete set? This question was motivated by a related reverse mathe- 
matics result for directed graphs [57], and the answer was unknown when the first 
edition of this book was published. Since that time, Schmerl [249] has shown that 
the answer is no, the set of indices of computable graphs with minimal spanning 
trees can be defined by an arithmetical formula. 


Lists of Theorems and Trees 


There are a number of interesting Ramsey-style theorems that are worthy of ex- 
ploration. Here is a list, with pointers to some good references. 


e Van der Waerden’s Theorem (and the related Szemerédi’s Theorem) [136] 
e Hindman’s Theorem (and the related Folkman’s Theorem) [136] 

e Milliken’s Theorem [205] 

e Galvin—Prikry Theorem [119] 

e Erd6s—Rado Theorem [168] 


Other good prospects for further study include special and regular Aronszajn 
trees, Suslin trees, and Kurepa trees, all appearing in [168] and [187]. 
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His was a name to conjure with in certain circles. 
— E. Wallace, The Just Men of Cordova 


In this section we will name a few authors whose books will be helpful to those 
interested in the further study of infinite sets, graphs, and combinatorics. Many 
good introductory texts on axiomatic set theory are available. For example, the 
books of Enderton [88], Moschovakis [209], and Roitman [241] are all accessible 
and remarkably distinct. For a more technical approach, Drake [80] and Levy 
[187] are good choices. Jech’s encyclopedic text [168] is an invaluable reference 
(and a good read). 

Several books give extended treatments of some specific topics in this chapter. 
Devlin’s book [73] gives a comprehensive coverage of constructible sets. Large 
cardinals and partition cardinals play a central role in Drake’s text [80]. The Axiom 
of Choice is the title and subject of another nice book by Jech [167]. For a detailed 
treatment of cardinal and ordinal arithmetic, it is hard to beat Sierpifiski’s old gem 
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[253]. It is very interesting to compare Sierpifski’s development with that of the 
ante-ZFC papers of Cantor, [47] and [48], both translated in [49]. 

Much of the historical content of the chapter was gleaned from van Heijenoort’s 
anthology [272] and Kanamori’s insightful article [173]. For the final word in 
many matters of logic, one can consult Kleene’s blue bible [176] or the more 
accessible text of Mendelson [201]. Finally, other treatments of infinite graphs 
and combinatorics can be found in the books of Ore [218] and Diestel [74]. 
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